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A  categorical  analysis  was  made  of  five  languages.  The 
sounds  of  speech  were  simulated  using  written  text  converted 
via  International  Phonetic  Alphabet  (IPA) .  The  sounds  of 
speech  were  identified  as  members  of  fricative,  nasal,  stop, 
or  vowel  categories.  A  statistical  analysis  was  performed 
on  categorical  content  of  one  (at  various  positions  in  the 
word) ,  two,  and  three  sound  combinations. 

Several  attempts  to  achieve  a  differentiation  scheme  were 
made  before  any  success  was  realized.  Two  methods  for 
developing  conditional  expectation  are  compared;  Bayes* 
Conditional  Probability  Rule,  and  Cook’s  Prognostic 
Progression. 

Statistical  analysis  and  "loop1’  tests  indicated  that 
languages  do  have  unique  patterns  and  can  be  differentiated 
on  the  statistics  contained  in  the  first  three  sounds.  1003 
correct  decisions  were  achieved  for  as  few  as  five  words  in 
the  loop  test.  Limited  base  data  negated  result  significance 
beyond  three  successive  sounds. 
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I. 


INTRODUCTION 


A.  LANGUAGE  PROBLEMS,  GENERAL 

There  has  existed  for  a  great  many  years  the  desire  and 
requirement  to  differentiate  languages  both  categorically 
and  specifically.  It  is  relatively  easy  to  accept  that  a 
person  can  mentally  detect  an  unknown  foreign  language  and 
guess  (often  accurately)  its  origin.  However,  this  mental 
process  has  not  been  successfully  duplicated  using 
mechanical/electrical  devices  is  spite  of  research  efforts. 

Language  barriers  have  historically  been  major  opponents 
of  international  well-being.  Interpreters  bridged  many 
gaps,  however,  the  lack  of  multi-linguists  necessitated  the 
need  for  a  large  number  of  bi-lingual  specialists. 
Therefore,  a  pool  of  interpreters  is  required  to  conduct  the 
routine  business  of  international  relations.  Linguist 
selection  is  simplified  for  some  physical  charac teristics, 
however,  it  would  be  quite  embarrassing  to  select  a  Chinese 
translator  for  a  Korean  dignitary.  Modern  voice 
communications  eliminate  all  physical  characteristics  except 
those  representative  of  the  speaker  and  his  language.  The 
question  then,  was  whether  the  sound  content  itself 
contained  sufficient  information  of  a  unique  quality 
(language  dependent)  so  as  to  make  it  distinguishable  from 
all  other  possibilities.  Certainly  it  is  desirable  to 
establish  communications  expediently  and  correctly. 

Languages  are  coded  utterances,  therefore,  it  seems 
possible  to  differentiate  the  languages  by  recognizing  the 
type  and  quality  of  code  being  used.  A  •'guess'*  as  noted 
above  has  some  merit,  but  lacks  the  continual  precision 
available  in  a  machine  (computer) . 
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B.  WRITTEN  LANGUAGES 


Written  communications  have  been  developed  based  on 
either  the  interpretation  of  the  coded  sound  or  a  picture 
representing  the  item  one  desires  to  convey.  Not 
surprisingly  there  has  been  a  move  in  the  interest  of 
internationalism  to  convert  the  pictorial  forms  of  some 
written  languages  to  the  Romanized  script.  Most  apparent 
was  the  recent  introduction  of  •’PINYIN"  to  the  Chinese 
language  [Ref.  1].  One  intent  of  the  conversion  is  the 
introduction  of  phonetics  to  the  school  systems  to  establish 
proper  pronunciation.  The  accepted  international  languages 
are  English,  French,  and  Spanish.  Since  these  languages  are 
Romanized,  the  conversion  from  pictorial  to  Roman  suggests 
that  future  generations  will  be  less  burdened  in  mastering  a 
second  language.  In  the  conversion  the  ideograms  are 
phonetically  transcribed  using  the  IPA  (discussed  in  section 
II)  . 


Phonetics,  then,  is  a  basis  through  which  sounds  are 
transcripted  and  establishes  a  common  base  between  the 
spoken  and  written  word.  It  is  the  purpose  of  this  paper  to 
establish  the  feasibility  of  automatically  differentiating 
languages  based  on  the  average  phonemic  content  over  various 
input  lengths.  The  source  data  used  was  various  phonetic 
transcriptions  of  the  selected  test  languages.  The 
reference  data  is  based  on  transcripted  word  frequency  (for 
usage)  lists,  some  text  selections,  and  conversation  samples 
for  each  of  the  selected  test  languages.  The  reference  data 
is  used  to  establish  matrices  of  probability  which  measure 
the  sound  types  and  sequences  in  languages. 
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c. 


SPEECH 


Speech,  as  stated  earlier,  is  any  length  of  coded 
utterances.  An  utterance  is  herein  defined  as  the  forced 
emission  of  modulated  air  from  an  oral  or  nasal  orifice. 
Coding  implies  that  some  means  is  applied  to  modify  the  air 
from  its  zero  state.  The  zero  state  is  defined  as 
unmodulated  inhaled  or  exhaled  air,  i.e.,  normal  breathing 
patterns.  The  modification  or  modulation  of  the  air  is 
achieved  by  the  mechanisms  of  speech  to  produce  audible 
sound  waves.  The  mechanics  of  speech  are  discussed  in  some 
detail  in  Appendices  A  and  B. 

The  coding  or  modulation  of  air  flow  is  classified  into 
the  major  categories:  vowels  (V)  ,  stops  (S)  ,  fricatives 
(F) ,  and  nasals  (N) .  These  categories  are  further 
subclassified  into  individual  sounds  which  are  unigue  in 
their  generation  and  in  the  characteristics  of  the  modulated 
pressure  wave  output.  A  more  detailed  description  of  these 
categories  can  be  found  in  section  2  of  Appendix  A. 

The  uniqueness  of  certain  sounds  has  been  a  detrimental 
aspect  to  experimenters  in  the  area  of  synthetic  voice 
generation.  Conversely,  certain  categories  have  striking 
similiarities  which  allow  simulator  redundancy.  In  the 
vowel  category.  Dr.  Rosenblum  [Ref.  2]  demonstrated  how  one 
vowel  sound  could  be  converted  directly  to  another  by  means 
of  selective  formant  filtering.  Other  works  with  specific 
sound  categories  are  discussed  in  section  3  of  Appendix  A. 

The  pause  is  a  special  case  of  coded  output,  and  is  an 
essential  element  in  speech  analysis.  Potter,  Kopp,  and 
Kopp  [Ref.  3]  interpreted  long  strings  of  sounds  as  recorded 
on  a  spectrogram  after  they  had  first  recorded  the  series 
"word — pause — word — pause"  spectrograms  and  were  sure  where 
the  word  boundaries  occurred.  In  speech  the  pauses  provide 


VI 


a  starting  and  ending  point  for  analysis.  The  time  between 
pauses  is  random,  as  is  the  actual  length  of  the  pause. 

The  data  between  the  pauses  is  composed  of  sounds  from 
the  major  categories,  and  for  this  analysis  it  was  assumed 
that  the  sounds  could  be  identified  at  least  categorically, 
and  removed  for  analysis.  The  actual  conversion  of  spoken 
speech  into  a  form  for  analysis  was  accomplished  indirectly 
via  phonetic  transcriptions  which  is  the  subject  of  part 
III. 
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II.  ANALYSIS  TECHNIQUES 


In  the  90th  meeting  of  the  Acoustic  Society  of  America, 
150  different  papers  were  presented  on  various  research 
efforts  being  done  in  the  voice/speech  field  [Ref.  4],  The 
speakers  varied  from  language  teachers,  phoneticians,  etc. , 
to  medical  doctors,  psychologists,  physicists,  and 
engineers.  They  represented  various  universities,  and 
private  organizations  from  the  United  States  and  abroad. 
There  were  several  papers  which  looked  into  differences  in 
languages,  but  were  microscopic  in  nature.  They  were 
concerned  with  the  specific  functions  of  the  system  which 
made  certain  vowel  or  consonant  sounds  in  one  language  vary 
from  another,  i.e.,  why  a  palatilized  "i"  sound  varied  from 
the  non-palatilized  "i.,r  Sore  specifically  is  it  only  the 
palatilizat icn  or  are  other  factors  involved.  For  example 
some  looked  at  sound  clusters  of  vowel-consonant 
combinations  (VCVVCCV) ,  and  focused  on  the  fact  that 
certainly  a  palatalized  ,,i"  is  forced  by  the  sound 
preceeding  or  succeeding  the  "i"  sound.  Other  studies  were 
slanted  toward  speaker  recognition  schemes.  Some  were  done 
in  languages  other  than  their  own,  while  others  looked  at 
recognition  criteria  in  their  native  languages.  These 
latter  results  in  general  hinted  that  language  dependent 
factors  do  exist.  Note  that  when  transcriptions  are  used 
speaker  dependence  is  somewhat  removed,  i.e.,  the  method  of 
speaking  (dialect)  is  retained,  however,  the  properties 
unique  to  the  speaker  (fundamental  frequency,  muscle 
impairment,  nose  size  and  shape,  etc.)  are  absent. 
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III. 


PHONETIC  TRANSCRIPTIONS 


A.  HISTORY 

1 •  General  Background 

Using  symbols  as  a  means  of  communications  goes 
back  through  several  civilizations.  It  is  almost  academic 
to  determine  whether  hieroglyphic  styles  came  prior  to, 
concurrently  or  after  vocalized  descriptions.  The  fact  is 
that  a  means  to  convey  thought  other  than  by  vocalization 
was  required,  and  developed.  The  various  alphabets  used 
throughout  the  world  have  certainly  proven  themselves  able 
vehicles  inside  of  their  boundaries,  but  simultaneously 
represent  an  additional  hurdle  to  commonality  across 
language  boundaries.  There  are  approximately  thirty 
different  alphabets  currently  being  used  throughout  the 
world.  It  was  not  documented  until  the  ninteenth  century 
that  the  presently  used  alphabets  were  deemed  to  be 
deficient.  Potter,  Kopp,  and  Kopp  [Ref.  3,  p.  1]  point  out 
that  Melvin  Bell  (father  of  Alexander  Graham  Bell)  developed 
a  set  of  symbols  which  specified  the  pronunciation  of  words 
so  accurately,  that  anyone  familiar  with  the  system  could 
repeat  the  word  precisely.  Bell’s  contribution  was 
primarily  aimed  as  an  aid  for  the  deaf,  although  it  also 
made  a  definite  impact  on  the  fields  of  language,  phonetics, 
and  the  electronic  analysis  of  speech;  e. g. ,  the  sound 
spectrograph  was  certainly  a  by-product  of  the  search  for  a 
visible  means  to  convey  what  is  heard.  A  brief  description 
of  the  sound  spectrograph  is  given  in  Appendix  A. 

2.  Development  of  the  IPA 

Language  teachers,  in  the  mid  1800's,  recognized 
certain  limitations  in  their  ability  to  teach  proper 
pronunciation  using  traditional  alphabets.  Many  teachers 
circumvented  these  limitations  by  developing  symbols  to 
describe  the  type  of  sound  required.  As  communications  and 
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transportation  technology  bridged  the  gaps  between  teachers 
they  began  to  discover  that  others  were  having  problems 
similiar  to  their  own,  thus  informal  teacher  collectives 
were  formed.  The  International  Phonetic  Association  [Ref. 
5]  attribute  their  origin  largely  to  the  efforts  of  Paul 
Passy  who  headed  a  small  group  of  language  teachers  in 
France  in  1886.  Passy  and  his  colleagues  found  that  the  26 
characters  in  the  Roman  alphabet  were  largely  inadequate 
when  trying  to  teach  proper  pronunciation.  Many  sounds  for 
the  same  letter  or  combination  of  letters  were  in  fact 
generated  in  the  vocal  system  with  distinctly  different 
mechanics.  Armed  with  this  information,  they  drew  up  a  new 
alphabet  to  fulfill  their  needs.  The  new  alphabet  was 
composed  entirely  of  symbols  each  of  which  represented  a 
unique  sound.  Their  new  alphabet  was  fully  defined  and  used 
throughout  the  text  of  the  first  publication  of  the 
periodical  "Dhi  Fonetik  Titcer  [Ref.  5]."  Others  were  drawn 
by  this  unique  idea  and  subsequent  events  resulted  in  the 
formation  of  the  International  Phonetic  Association.  The 
work  of  Samuel  Bell  was  done  independently  of  the  European 
group,  and  his  symbols  actualy  resembled  hieroglyphic 
techniques,  rather  than  an  expanded  version  of  the  Roman 
alphabet,  which  was  the  objective  of  Passy.  Therefore, 
Bell's  alphabet  faded  with  time.  The  association  grew  as 
numerous  inputs  were  made  from  various  other  countries  and 
field-phoneticians.  Eventually  a  collection  of  phonetic 
transcriptions  of  languages  (English  et  al)  were  assembled, 
and  the  IPA  was  expanded  to  include  sounds  unique  to 
previously  unavailable  language  transcriptions.  This  work 
across  language  boundaries  enabled  many  languages  to  be 
taught  much  more  easily  both  in  native  and  second  languages. 
Heretofore,  languages  could  be  described  by  the  basic 
parameters  as  usual  using  the  unique  structures  and 
additionally  by  those  sound  production  techniques  unique  to 
a  specific  language. 
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In  the  IPA  every  syobol  has  a  distinct  sound  associated 
with  it.  There  are  some  89  different  symbols  encompassing 
all  but  the  African  languages  as  noted  by  Dr.  Francis 
Cartier  [Ref  6].  Only  43  of  these  symbols  appear  in 
American  English,  (depending  on  the  reference  used;  general 
range  is  42-46) ,  which  makes  its  transcription  much  easier. 
In  "The  Phonetic  Alphabet,'*  Dr.  Cartier  describes  in  detail 
the  techniques  of  transcripting  the  English  language,  and 
has  a  small  treatment  of  other  languages  and  non-sense  words 
[Ref.  6].  In  order  that  the  technique  for  this  research  be 
expandable  to  eventually  include  all  languages  a  computer 
version  was  developed  for  101  IPA  characters. 

B.  THE  TRANSCRIPTION 

A  transcription  consists  of  writing  (in  symbols)  the 
sounds  one  makes  when  speaking.  This  can  be  accomplished 
quite  effectively  (after  training)  in  either  a  native 
language  or  a  language  for  which  adequate  knowledge  of 
proper  pronunciation  is  known  first-hand.  If  the  language 
is  foreign  to  the  transcriptor  there  is  considerable  room 
for  error,  particularly  if  sessions  cannot  be  spread  so  as 
to  keep  the  speaker,  interpreter,  and  transcriptor  in  a 
fresh  state  of  mind.  If  a  transcriptor  is  transcripting 
from  an  unknown  language,  which  appears  close  to  one  he  is 
vaguely  familiar  with  he  will  tend  to  do  the  transcription 
in  the  symbols  and  forms  associated  with  the  most  familiar 
language  until  he  has  a  key  to  the  language  being 
transcribed,  then  he  will  retrace  and  make  whatever 
corrections  are  required  to  maintain  the  quality 
(uniqueness)  of  that  language  [Ref.  7],  This  could  lead  to 
an  erroneous  transcription  if  a  key  group  of  words  were  not 
located,  and/or  the  language  had  not  been  the  subject  of  a 
detailed  analysis  to  determine  which  if  any  sounds  were 
definitely  unique  to  that  language.  Another  problem 
presented  along  these  same  lines  is  dialect.  The  simple 
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word  "the"  can  be  properly  pronounced  several  ways  depending 
upon  the  geographical  area  of  the  world  in  which  the  word  is 
being  used.  Therefore,  it  is  possible  for  the  transcriptor 
to  write  0i  when  he  really  should  have  written  cfe . 

C.  SOURCES  AND  USES 

The  transcriptions  used  for  this  paper  were  obtained 
from  a  variety  of  sources.  Most  of  the  test  text  portions 
were  taken  from  "The  Principles  of  the  International 
Phonetic  Association,  1949."  There  is  a  second  group  of 
text  particularly  done  in  a  conversational  mode  type 
obtained  from  Micholin  Ponder  of  the  Research  Department  of 
the  Defense  Language  Institute,  Monterey,  California. 
Additional  samples  of  conversational  text  wa$  made  available 
by  Dr.  R.  Hohlford,  and  particularly  useful  English  samples 
were  taken  from  "Informal  Speech"  [Ref.  8],  A  sample  of 
Spanish  was  contributed  by  Dr.  Maria  Baird,  Chairman, 
Spanish  Department,  DLI,  Monterey,  Ca. 

There  are  also  several  dictionaries  which  use  the  IPA 
symbols  for  their  pronunciation  guides.  One  such  source  for 
American  English  is  "A  Pronouncing  Dictionary  of  American 
English,"  by  Kenyon  and  Kott  [Ref.  9].  Several  foreign 
dictionaries  were  scanned  as  well  as  language  textbooks,  in 
most  cases  the  IPA  was  used  sparingly,  making  it  extremely 
difficult  to  collect  a  satisfactory  representation. 


17 


IV.  DIFFERENTIATION  DEVELOPMENT 


The  actual  classification  of  specific  sounds  has 
historically  been  achieved  first  by  aural  perception,  and 
later  by  electronics  devices  such  as  the  oscilloscope,  the 
spectrograph,  and  the  computer.  All  of  these  devices 
required  manual  intervention.  That  is,  the  sounds  are 
processed  and  recorded  so  that  they  can  be  tangibly  observed 
and  interpreted  for  content,  classification,  etc.  Presently 
no  real  time  conversion  exists  for  the 
interpretation/classification  domain.  Developing  a  routine 
or  algorithm  to  directly  convert  spoken  speech  into  some 
form  of  usable  data  does  not  appear  infeasible.  There  are 
currently  research  projects  being  conducted  in  this  area. 
Hess  [Ref.  10],  although  not  specifically  oriented  to 
language  differentiation  developed  a  recognition  model  whose 
techniques  and  results  may  in  fact  be  very  applicable  to  a 
real  time  classification  converter.  For  the  purposes  of 
this  research  it  was  assumed  that  such  a  device  could  be 
produced. 

Although  it  is  not  clearly  understood  how  the  mind  can 
differentiate  languages,  it  seemed  that  a  model  could  be 
developed  to  produce  this  differentiation  function,  not  on  a 
"guess"  basis  but  with  some  reasonable  assurance  as  to  which 
language  was  being  used.  Here,  it  was  felt  that  the  sound 
content  might  be  the  key.  The  research  is  based  on  these 
assumptions  and  the  belief  that  the  IPA  represents  an 
appropriate  sound  classification  capturing  vehicle. 

A.  DATA  EASE  SELECTION 

The  first  problem  was  to  determine  a  satisfactory  data 
base  from  which  one  could  establish  the  categorical  sound 
statistics.  Considerable  research  has  been  done  on 
frequency  factors  for  use  of  words  under  such  conditions  as: 
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ordinary  (informal)  conversation,  formal  speaking  (addresses 
etc.),  informal  writing  (letters),  and  formal  writing 
(textbooks,  newspapers,  periodicals,  etc.),  and  these  lists 
are  available  in  a  number  of  languages.  After  obtaining 
such  lists  it  became  necessary  to  convert  first  into  the 
Roman  alphabet,  and  then  to  transcribe  from  the  Romanized 
script  into  the  IPA  symbology.  As  discussed  above  in  the 
transcription  section,  this  did  not  propose  an  easy  task  for 
a  non-native  novice  transcriptor.  Therefore,  solicitations 
for  assistance  particularly  from  DLI,  were  abundant  [refs. 
11,  12].  The  question  arose  concerning  the  possibility  of 
error  introduction  when  using  word  lists.  The  major  error 
would  seem  to  appear  when  the  factor  of  frequency  is  not 
utilized  within  the  list  itself,  i.e.,  if  the  frequency  of 
the  word  were  1/10  then  one  would  expect  every  tenth  word  in 
the  list  to  be  the  same.  A  further  example  is  drawn  from 
the  Markov  process.  One  such  use  of  the  Markov  process  is 
to  establish  the  frequency  of  occurrence  of  letters 
(singularly  and  in  combination)  for  the  given  language. 
Then  a  string  is  developed  using  those  frequencies  (note 
that  in  English  one  considers  27  characters  to  include  the 
pause) .  A  typical  string  would  appear  as: 

EAMI  HEI3RAGEL - etc. 

The  frequency  of  use  is  a  must  in  this  case,  and  is 
generally  determined  by:  (for  the  single  letter  case) 

Frequency  (desired  letter)  =  (desired  letter) _  (1) 

-  (letters) 

Obviously  the  string  of  characters  does  not  make  sense. 
However,  when  one  looks  at  the  strings  generated  by  the  two 
letter  combinations  [of  which  there  are  (27) 2  or  (n)2],  they 
may  find  some  words  (simple)  begin  to  appear.  Using  the 
three  letter  combination  level  (n) 3  more  words  begin  to 
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appear.  The  process  could  of  course  continue  ad  infinitum 
and  at  each  step  would  generate  more  complex  words  and 
perhaps  even  paragraphs.  An  analysis  very  similiar  to  the 
Markov  Chain  type,  just  described,  was  used  by  Rau  [Ref. 
13],  to  develop  a  process  by  which  short  samples  of  written 
text  could  be  automatically  differentiated.  Rau  looked  at 
two  languages,  and  achieved  results,  ranging  from  51%  to 
100%  accuracy  factors  which  were  proportional  to  the  length 
of  the  "short  samples"  he  used  [Ref,  13,  p.  21,  ff ].  In  the 
case  of  word  lists,  it  would  appear  that  by  not  at  least 
using  a  Markcv  process  something  would  be  lost.  However,  in 
the  early  testing,  the  single  listings  of  words  generated 
data  very  close  to  that  of  various  random  text  and 
conversation  inputs.  Therefore,  it  was  concluded  that  a 
significant  error  was  not  introduced  by  using  the  most 
common  words  independent  of  their  frequency  factor. 

B.  COMPUTER  LANGUAGE  DECISION 

It  was  apparent  that  data  processing  should  be 
accomplished  by  a  computer  but  the  choice  of  a  language 
(compiler)  was  not  immediately  obvious.  FORTRAN  17  was 
attempted,  but  rejected  because  of  the  conversions  required 
in  the  input/output  (I/O)  for  the  raw  data.  The  language 
SNOBOL  IV  (used  in  conjuction  with  SPITBOL)  is  probably  the 
best  language  [Ref.  14],  however,  arrangements  for  its 
installation  could  not  be  achieved  in  time  to  be  effective 
for  this  research.  ALGOL  W  contains  qualities  of  both 
FORTRAN  and  COBOL,  and  was  ultimately  selected  because  of 
its  inherent  ability  to  process  both  text  and  equations. 
The  initial  program  was  developed  by  M.  C.  Thoraas[Ref  15], 
using  strictly  English  data,  and  a  special  alphabet 
conversion  code  (ROMI)  consisting  of  the  44  IPA  sounds 
associated  with  the  English  languages.  The  program 
developed  the  statistics  for  the  first  sound,  the  first  two 
sounds,  first  three  sounds,  and  the  first  four  sounds  of  the 
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data  input.  The  statistics  were  developed  in  the  program  by 
reading  in  the  data  word  by  word,  then  converting  the  data 
into  the  sound  categories  (F,N,S,V),  via  ROMI,  letter  by 
letter.  The  converted  words  were  processed  to  extract  the 
appropriate  statistics  to  produce  the  data  shown  in  computer 
output  1. 

As  languages  outside  of  English  were  drawn  into  the  data 
base,  ROMI  had  to  be  increased,  and  was  ultimately  expanded 
to  its  present  state  of  101  characters.  The  number  of 
symbols  (101)  decided  upon  was  not  entirely  the  result  of 
the  different  languages  introduced  or  tested,  rather  the 
number  was  selected  as  being  the  universal  set  recognized  by 
the  International  Phonetic  Association  [Ref.  5],  plus  some 
diphthongs  uniquely  identified  by  Cartier  and  Todaro  [Ref. 
6].  Thus  ROMI  is  capable  of  converting  any  language 
subjected  to  the  models.  The  ROMI  to  IPA  conversion  is 
listed  by  categories  in  the  appendix  C  tables. 

Five  languages  were  selected  at  random,  from  among  those 
for  which  frequency  tables  and  translators  or  previously 
transcripted  materials  were  available.  Theoretically  the 
languages  are  independent  variables  to  any  algorithym  and 
thus  were  designated  A,  B,  C,  D,  and  E. 

C.  FIRST  SOUND  ANALYSIS 

Results  similiar  to  the  first  data  run  are  shown  in 
computer  output  1.  The  sound  pattern  of  the  first  letter 
per  word  (or  phrase,  in  the  conversational  cases)  showed 
some  promise  of  a  pattern,  even  though  they  were  bunched 
together  at  the  lower  end  of  the  scale  (see  figure  1) .  The 
words  were  separated  from  the  phrases  to  ptove  (or  disprove) 
similarity,  and  similarity  did  occur,  thus  disproving  that 
an  error  factor  was  introduced  because  of  using  word  lists 
without  frequency  considerations. 
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A  decision  table  was  drawn  up  using  ranges  established 
by  figures  1-a  and  1-b.  A  new  set  of  data  for  a  single 
language  was  then  analyzed  and  its  results  compared  with  the 
table.  The  test  data  is  indicated  by  the  X's  in  figure  2. 

The  results  of  the  single  test  indicated  that  the  sample 
size  was  inadequate.  Additionally,  it  appeared  that  the 
ranges  would  increase  proportionately  with  the  sample  size, 
unless  a  convergence  could  be  determined.  If  the  ranges 
were  to  increase,  there  would  be  an  even  greater  overlapping 
of  regions,  and  decisions  would  be  made  more  difficult. 
Therefore  it  was  concluded  that  differentiation  was  not 
feasible  on  the  basis  of  a  single  sound. 
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Figure  1.  First  Sound  Frequencies 
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Figure  2.  First  Sound  Decision  Table 
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D. 


NORMALIZATION  CROSS-SECTION 


The  second  approach  dealt  with  observing  a  cross-section 
of  the  languages  collectively  on  the  basis  of  categorical 
content.  This  method  of  analysis  presumed  that  the  sounds 
would  be  randomly  distributed  in  a  "Gaussian11  fashion.  The 
decision  factor  selected  was  the  Shannon  Theorem  [Ref.  16, 
chapter  3].  In  the  Shannon  theorem  the  data  is  assumed 
Gaussian,  and  three  situations  occur:  (1)  the  curves  are 
disjoint  and  the  decision  is  immediately  obvious;  (2)  the 
curves  totally  overlap  and  no  decision  is  possible;  or  (3) 
there  is  a  slight  overlapping  and  some  criterion  can  be 
developed  which  will  give  reasonable  predictions  of  the  case 
at  hand,  providing  the  error  factor  (amount  of  overlap)  is 
not  severe.  The  three  situations  are: 


Figure  3.  Shannon  Theorem  Decision  Curves 


It  was  anticipated  that  either  case  (1)  or  (3)  would 
emerge.  Therefore,  the  statistics  obtained  from  the 
computer  printout  were  "normalized"  or  "Gaussianized"  in 
accordance  with  the  following: 
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X  =  the  %  of  occurrence  of  a  category 
in  the  first  three  and  last  two 
sound  positions  of  a  word.  %  =>  frequency. 


(2) 


(3) 


(4) 


i  =  1,  (1/2)  ,  (1/2,3)  ,  (n- 1 )  ,  (n)  . 


In  the  cas^  of  the  fricatives  for  language  "A"  the 
content  (from  computer  output  1)  was: 


E(X)  = 


- 


<r(x)  =  _ J1/2 


f  f  x)  =  1  exp 

Tfx r~ 


{-  [^p-] 


X  =  Random  Variable 
-  frequency  of 

sound  location  occurrence  of  F 


ITTUB- 

00.98 

17.46 

15.46 
01.35 


E(X) 


The  computed  mean  is: 

=  33.43  +  0  .98  +  17.46  + 

- s - 


15.46  ♦  1.35 


=  13.75 


Similarily  the  standard  deviation  was  calculated  to  be: 
v(x)  =  10.22 
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Then  using  table  6.1  [Ref.  17]  the  Gaussian  density  (pdf) 
for  fricatives  was  established  for  language  A.  Using 
identical  procedures  the  fricative  densities  were  created 
for  the  remaining  four  languages  and  superimposed  on  the 
graph  for  language  A.  The  same  procedure  was  followed  to 
establish  graphs  in  each  category.  The  tables  and  graphs 
for  this  procedure  are  contained  in  appendix  D.  The  results 
were  clearly  not  what  was  expected.  In  fact.  Shannon's  case 
(2)  is  approached  in  each  graph.  Even  in  the  graphs  where 
case  (3)  was  there,  in  spirit,  the  error  factor  is  clearly 
greater  than  the  Shannon  theorem  could  accept.  It  was 
speculated  that  increasing  the  reference  data  would  not  be 
sufficient  to  obtain  more  dispersion  in  the  graphs. 
Therefore,  this  technique  was  Abandoned. 

E.  TWO-SOUND  PROBABILITIES 

The  computer  program  which  develops  the  statistics 
creates  them  such  that  the  frequency  information  follows  the 
tree  branch  technique,  as  shown  in  figure  4. 

The  frequencies  registered  at  each  point  in  the  tree  is 
an  exact  probability.  Therefore  at  every  sound  level  for 
each  language  the  probability  per  language  is  available. 
These  probabilities  are  grouped  by  language  in  computer 
output  1,  and  are  summarized  in  table  D-I,  in  appendix  D. 

i 
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Figure  4.  The  Frequency  Tree. 


Assuming  that  all  languages  are  eguiprobable 
[P  (A)  =P  (B)  =P  (C)  =P  (D)  =P(E)  =1/5  ],  liklihood  ratios  can  be 
developed  using  Eayes  rule;  which  computes  the  probability 
of  a  given  language  based  on  the  categorical  sound  occurring 
at  a  specified  position  in  the  word.  For  example  the 
probability  of  a  test  word  being  from  language  "X”  given 
that  a  sound  category  was  identified  in  position  two  (second 
sound)  would  be  computed  as  follows; 


P(XU2) 


where;  X  =  language  =  A,  B,  C,  D,  E 

Y2=  sound  category  =  F,  N,  S,  7 

i  =  position  indicator  =  1,  2,  3,  4,  n- 1 ,  n 
j  =  position  indicator  =  1,  2,  3,  4,  n- 1 ,  n 


These  probabilities  are  tabulated  in  table  IV-I.  A  decision 
can  be  achieved  using  the  maximum  likelihood.  For  example, 
from  table  IV-I,  for  an  "S”  in  the  2nd  position  one  would 
choose  language  E  because  it  clearly  has  the  the  greatest 
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probability  of  being  the  correct  choice.  In  the  case  where 
more  than  one  word  from  the  same  language  is  being  analyzed 
the  decision  criteria  reguires  an  additional  iteration;  this 
presumes  the  words  have  different  sound  content  arrangements 
(order) .  When  the  position  is  selected  (say  at  the  second 
sound  position)  the  rows  are  removed  in  tact.  The  removed 
rows  are  then  isolated  and  a  product  is  calculated  for  the 
column  elements.  The  correct  choice  is  then  the  columnar 
product  which  is  largest/  i.e.,  has  the  greatest  likelihood. 
Multiplication  corresponds  to  the  assumption  that  words  are 
chosen  independently  at  random  from  the  language.  For 
example,  given  a  first  word  has  a  "V"  in  the  last  position 
the  maximum  likelihood  would  elect  language  B,  however,  if 


the  second  word  had  an 

product  is  taken, 

likelihood. 

"S"  in 

language 

the  last 

D  will 

position 

have  the 

and  the 

maximum 

V  .074 

.313 

.229 

.  126 

.  258 

S  .220 

.000 

.137 

.491 

.  153 

product  .016 

.000 

.031 

.062 

.  039 

sum  .294 

.313 

.466 

.617 

.41 1 

Using  the  product  method,  clearly  eliminates  any 
language  which  is  devoid  of  a  unique  sound.  The  advantage 
of  the  summing  method  is  that  languages  which  have  high 
percentage  of  occurrence  are  brought  out  for  consideration 
in  spite  of  a  categorical  void.  Arguments  can  be 
established  on  which  is  the  better  method  (sum  or  product) , 
however,  it  was  decided  to  use  the  product  method  in 
subsequent  decisions.  The  product  of  one  or  more  column 
elements  is  referred  to  as  "validity  factors"  in  computer 
program  2;  a  product  of  validity  factors  are  called 
"validity  products." 
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In  spite  of  the  closing  statement  in  section  IV-C,  that 
differentiation  was  not  feasible  on  the  basis  of  a  single 
sound,  testing  was  performed  on  some  200  random 
words/phrases.  The  results  (not  included)  were  ambiguous 
and  supported  the  earlier  claim. 


Bayes  Rule  still  offered  more  potential  than  previous 
efforts,  therefore,  the  two-sound  combinations  [of  which 
there  are  16;  F,  N#  S,  V  =>  (4) 2  ],  were  computed  in  the 

following  manner; 


P(X|  Y2Y,  ) 


where; 


— ruffian' 


X-A.E 


X  =  language  =  A,B,C,D,E 

Y  =  sound  category  =  F,N,S,V 

P(X)  =  P(A)=P(B)=P(C)=P(D)=P(E), 
by  assumption 

These  values  are  given  in  Table  IV-II. 


(6) 


The  decision  rule  is  identical  to  that  for  one-sound, 
i.e.,  maximum  likelihood.  Given  any  word(s)  greater  than 
one  sound,  a  decision  can  be  reached.  Again  two  or  more 
words  dictate  the  product  (or  sum)  rule  to  obtain  the 
maximum  likelihood. 


The  results  of  limited  testing  (30  words)  using  this 
method  were  below  the  desirable  limit.  In  the  case  of  one 
random  word,  decisions  were  less  than  20%  correct;  for  two 
words  less  than  50%  correct;  and  three  word  decisions  ranged 
from  45-55%  correct.  Though  the  correct  decisions  were 
proportional  to  the  number  of  words,  the  decision  procedure 
appeared  to  be  weak.  Presuming  this  was  a  result  of  the 
earlier  assumption  that  all  languages  were  eguiprobable,  a 
procedure  similiar  to  Bayes  was  constructed  and  defined  as 
follows; 
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(7) 


X-A.E 

where:  X  =  A,B,C,D,E 

Y  =  F,N,S,V 

Again  the  assumption  of  equiprobable  languages  was  inserted 
thus  removing  the  [P(X)]2  terms.  The  remaining  equation  is 
hereby  defined  as  "Prognostic  Progression."  The  equation 
calculates  the  "Logical  Prognosis"  of  a  language  given  the 
frequency  of  occurrence  at  a  position  and  the  probability 
[or  Logical  Prognosis  (L) ]  brought  forward  from  the  previous 
position  in  the  tree  (fig.  4) .  Table  IV-III  was  calculated 
using  this  procedure,  and  is  compared  with  the  Bayes 
solutions  in  Table  IV-IV.  The  results  are  interesting. 
They  show  that  the  resultants  are  shifted,  from  one  table  to 
the  next,  and  tend  to  negate  the  assumption  of  language 
equiprobability.  Thus  the  probability  of  language  arrivals 
is  introduced,  but  indirectly,  at  any  selected  level  except 
the  first  (single  sound) .  This  enables  the  use  of  variable 
probability  for  reference  languages,  i.e.,  unknown 
reference  sample  sizes  are  inherently  accounted  for.  Three 
comparative  tests  were  run  for  Tables  IV-III  versus  IV-II 
using  100  randomly  selected  words  (20  per  language) .  The 
results  showed  for  the  two-sound  level  that  the  formulas  had 
relatively  the  same  effect.  The  overall  results  achieved 
were:  (note,  100  word  samples  increased  minimum  correct  for 

this  test  to  40%  compared  to  previous  20%) 


%  CORRECT 


Procedure 


test  1 


test  2  test  3 


BAYES 


39 


40 


38 

40 


COOKS 


4  1 


39 
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Further  test  at  the  three-sound  level  were  conducted; 
results  very  similiar  to  the  two-sound  results  were 
obtained.  The  number  of  correct  choices  were  within  1 — 4% 
of  each  other,  however,  it  was  noted  that  in  4  of  7  tests 
(avg.  words  =  35)  the  magnitude  of  the  correct  over  the  next 
competitor  was  much  greater  with  Cook's  method. 
Additionally,  the  order  of  magnitude  from  the  most  likely 
(maximum  product)  to  the  second  most  likely,  second  to 
third,  etc.,  was  much  more  even,  and  smaller  in  Bayes  than 
in  Cooks.  It  was  thus  assumed  that  the  idea  of  "Prognostic 
Progression"  was  at  least  equivalent  to  Bayes  Rule.  This 
assumption  encouraged  the  writing  of  computer  program  number 
2. 


The  basic  operation  of  computer  program  2  is  a 
continuation  of  program  1,  except  that  some  extra  features 
were  added.  The  primary  extra  feature  is  variable 
partitioning,  wherein  words  of  the  same  sound  length  are 
analyzed  collectively  by  language.  The  partitioning  is 
variable  in  that  the  statistics  can  be  ended  at  any  point, 
i.e.,  all  words  >1,  create  the  same  data  as  in  computer 
output  1;  words  >2,  creates  data  for  one  sounded  words, 
apart  from  the  data  for  words  of  two  sounds  and  greater; 
this  process  is  allowed  through  words  >6.  The  decision 
portion  forces  a  word  to  follow  a  flow  path  based  on  the 
number  of  sounds  the  word  contains.  By  doing  this,  for 
example,  three  sounded  words  are  only  compared  with  all 
other  three  sounded  words  in  the  data  collection,  unless  the 
the  partitioning  is  set  at  >2  or  >3.  The  standard  test 
(comparisons)  in  the  flow  path  for  a  partitioning  of  >6,  is 
provided  in  table  IV-VII,  and  the  program  flow  diagram  is 
provided  in  figure  5.  Secondary  features  include  selectable 
graphic  representations  of  either  the  raw  statistics,  or  the 
Prognostic  Progression  values  for  any  or  all  reference  data. 
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The  model  was  tested  using  randomly  selected  words  from 
each  language  within  the  data  base.  This  "loop"  test  was 
used  to  determine  if  the  model  could  correctly  segregate  and 
identify  what  it  already  held  in  storage. 

The  first-two  position  results  related  a  noticeable 
similariity  between  the  languages,  and  required  at  least  a 
two  word  sample  size  to  achieve  correct  decisions  in  50%  of 
the  tests.  The  results  of  this  test  are  tabulated  in  table 
IV-V.  Tests  involving  the  first-three  sounds  showed  even 
greater  improvement  in  the  results.  This  indicated  that  at 
the  three-sound  level  unique  patterns  were  emerging.  The 
test  of  three-sounds  duplicated  the  procedure  used  for  the 
two-sounds  and  their  results  are  tabulated  in  table  IV-VI. 

Testing  beyond  this  point  was  considered  insignificant 
due  to  the  limited  reference  data.  Beyond  three-sounds,  the 
tests  were  approaching  100%  for  as  few  as  five  word/phrase 
test  samples.  The  consensus  was  that  even  though  patterns 
were  uniquely  developing,  their  percentage  relative  to  the 
whole  of  a  language  negated  the  significance  of  the 
patterns. 

A  final  group  of  tests  was  run  with  the  program  changed 
to  use  Bayes'  Rule,  rather  than  the  Prognostic  Progression. 
The  results  (not  included)  were  for  all  practical  purposes 
identical.  The  Bayes' Rule  method  also  forced  100%  correct 
decisions  as  early  as  five  words.  On  the  basis  of  this  test 
it  is  concluded  that  Bayes'  Rule  and  Cooks  Prognostic 
Progression  perform  equally  on  this  type  of  task. 
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Table  IV-I 


Bayes 

' Theorem 

Calculations 

for  one- 

sound 

Yi 

P  (A  |  Y;  ) 

P  (  B  |  Yj  ) 

P  (C|  Y;  ) 

P(D|  Yj  ) 

P(E|Y;  ) 

Fi 

.237 

.204 

.  142 

.179 

.235 

N, 

.  179 

.181 

.223 

.260 

.157 

s, 

.  281 

.  167 

.  185 

.  140 

.227 

V, 

.084 

.250 

.266 

.252 

.148 

F2 

.035 

.  161 

.374 

.169 

.261 

n2 

.07  0 

.350 

.234 

.  148 

.200 

s2 

.  11  1 

.029 

.185 

.269 

.405 

v2 

.27  0 

.  173 

.  169 

.236 

.153 

*3 

.230 

.129 

.224 

.254 

.162 

N3 

.284 

.  160 

.197 

.202 

.157 

S3 

.  07  8 

.300 

.208 

.130 

.383 

V3 

.174 

.215 

.  187 

.212 

.212 

r4 

.  041 

.186 

.217 

.302 

.253 

n4 

.  146 

.26  3 

.213 

.096 

.283 

S4 

.499 

.083 

.  131 

.  140 

.148 

Y4 

.  130 

.231 

.  223 

.252 

.163 

Fn-, 

.  02  2 

.396 

.  151 

.214 

.216 

.  144 

.  169 

.296 

.  154 

.237 

Sn-1 

.051 

.328 

.  162 

.179 

.280 

Vn-1 

.358 

.098 

.  178 

.233 

.134 

Fn 

.293 

.138 

.  067 

.297 

.205 

Nn 

.  396 

.  102 

.226 

.194 

.096 

Sn 

.220 

.000 

.137 

.491 

.153 

Vn 

.074 

.  313 

.229 

.126 

.258 
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Table  IV-II 


Bayes'  Theorem  Calculations 
for  two-sounds 


P  (A  |  Yj 

Yj  )  P  (B  J  Yj  Yj  ) 

P(C|  Y|  Yj  ) 

P(D|  Yj  Yj  ) 

P  {  E  |  Yj  Yj  ) 

F,  F2 

.000 

.000 

.000 

.000 

1.0 

F,  N2 

.037 

.388 

.  154 

.068 

.353 

Ft  S2 

.  282 

.000 

.000 

.137 

.581 

F,  V2 

.303 

.199 

.  188 

.190 

.120 

N,  F2 

.000 

.000 

.000 

.000 

.000 

N,  N2 

.000 

.  161 

.  400 

.000 

.438 

Hi  S2 

.000 

.000 

1.0 

.000 

.000 

N,  V2 

.  200 

.  163 

.  179 

.328 

.130 

Si  f2 

.  192 

.000 

.000 

.351 

.457 

S,  n2 

.062 

.324 

.243 

.151 

.221 

Sj  s2 

.000 

.000 

.000 

.000 

1.0 

s,  v2 

.298 

.192 

.  154 

.  14  1 

.216 

7,  F2 

.000 

.334 

.443 

.125 

.098 

V,  U2 

.  119 

.276 

.  285 

.246 

.075 

Vi  S2 

.04  4 

.052 

.403 

.081 

.420 

v,  v2 

.  188 

.  109 

.  097 

.570 

.037 
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Table  IV-III 


Cookfe  Prognostic  Progression  Calculations 
for  two-sounds 


\ 

L  (A  |  Yj  Yj 

)  MBIY,  Tj  ) 

MCI  Yj  Yj  ) 

L(D|  Ys  Yj) 

MEIYj  Yj  ) 

F,  F2 

.000 

.000 

.000 

.000 

1.0 

F,  N2 

.045 

.359 

.  114 

.06  2 

.421 

F,  S2 

.293 

.000 

.000 

.108 

.599 

F,  V2 

.363 

.  184 

.  138 

.  172 

.142 

N,  F2 

/ 

.000 

.000 

.  000 

.000 

.000 

»i  n2 

.000 

.144 

.483 

.000 

.372 

N,  S2 

.00  0 

.000 

1.0 

.000 

.000 

N,  V2 

.171 

.  130 

.  192 

.409 

.098 

Si  f2 

.253 

.000 

.000 

.231 

.516 

S,  112- 

.085 

.334 

.219 

.103 

.260 

Si  s2 

.000 

.000 

.000 

.000 

1.0 

s,  v2 

.373 

.  181 

.126 

.088 

.232 

V,  f2 

.000 

.325 

.486 

.130 

.060 

Vi  N2 

.045 

.290 

.339 

.276 

.050 

V,  S2 

.018 

.059 

.521 

.099 

.302 

V,  V2 

.073 

.  118 

.  119 

.664 

.026 

35 


Table  IV-IV 


Shifting  Effects  Denoted 
by  the  Difference  in  (V-III)  -  (V-II) 


A 

B 

C 

D 

E 

FF 

.  000 

.000 

.000 

.000 

.000 

FN 

.008 

.  029  (-) 

.040  (-) 

.006 (-) 

.068 

FS 

.011 

.000 

.000 

.029  (-) 

.018 

FV 

.060 

.015  (-) 

.050  (-) 

1 

• 

o 

00 

.022 

NF 

.000 

.000 

.000 

.000 

.000 

NN 

.000 

.017  (-) 

.083 

.000 

.066  (-) 

NS 

.000 

.000 

.000 

.000 

.000 

NV 

.029  (-) 

.033  (-) 

.  013 

.081 

.032  (-) 

SF 

.061 

.000 

-000 

.120  (-) 

.059 

SN 

.023 

.010 

-  024  (-) 

.048  (-) 

.039 

SS 

.000 

.000 

.  000 

.000 

.000 

sv 

.075 

.01  1  (-) 

.  028  (-) 

.053  (-) 

.016 

VF 

.000 

.009  (p) 

.043 

.005 

.038  (-) 

VN 

.  074  (-) 

.014 

.054 

.030 

.025  (-) 

VS 

.026  (-) 

.007 

.118 

.018 

-  1 1  3  (-> 

VV 

.009 

.022 

.  094 

.011 
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Table  IV-V 


Two-Sound  Results 
using  Prognostic  Progression 


ONE-WORD-  TEST: 


Lng 

smpl 

size 

A 

B 

C 

D 

E 

A 

33 

.637 

.121 

.000 

.  182 

.061 

B 

25 

..240 

.400 

00 

O 

• 

.  120 

.016 

C 

49 

.367 

.  204 

.  163 

.265 

.000 

D 

42 

.286 

.  143 

.000 

.  571 

.  000 

E 

60 

.317 

.050 

.150 

.150 

.333 

TWO-WORD  TEST: 

smpl 

Lng  size  A 

B 

C 

D 

E 

A  16  .563 

.  125 

.000 

.313 

.  000 

B  12  .083 

.583 

.083 

.083 

.  167 

C  24  .167 

.  292 

.2  08 

.330 

.000 

D  21  .238 

.095 

.000 

.667 

.000 

E  30  .133 

.  067 

.  133 

.167 

.  500 

Average  correct  single  word  =  83/209  =  .397 
Average  correct  two  words  =  50/90  =  .556 

Sample  sizes  of  15  and  greater  gave  results  of  100%  correct 
selection  in  4  of  5  tests. 
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Table  IV-VI 


Three-Sound  Results 
using  Prognostic  Progression 


ONE-WORD  TEST: 


smpl 


Lng 

size 

A 

B 

C 

D 

E 

A 

33 

.667 

.033 

.100 

.233 

.067 

B 

25 

.160 

.  480 

.160 

.  160 

.040 

C 

49 

.306 

.041 

.327 

.265 

.  061 

D 

42 

.214 

.071 

.04  8 

.595 

.071 

E 

60 

.217 

.083 

.  133 

.167 

.400 

TWO-WORD  TEST: 

smpl 

Lng  size  A  B 

C 

D 

E 

A  16  .688  .000 

.000 

.313 

.  000 

B  12  .083  .580 

.  167 

.  167 

.  000 

C  24  .167  .042 

.500 

.292 

.000 

D  21  .095  .048 

.048 

.810 

.000 

E  30  .100  .067 

.100 

.  167 

.567 

Average  correct  single  word  =  97/209 
Average  correct  two  words  =  64/103  = 
Sample  sizes  of  11  and  greater  gave 

=  .464 

.621 

results  of 

100%  correct 

selection  in  all  tests. 
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Table  IV-VII 


flodel  Test  Path 
for 

A  Given  Word  Length 


word 

length  Test  Criterion 


1. X  P(x) 

2.  P(x)  ,  L  (y,  y2  |x) 

3.  P(x)  ,  L  (y,  y2  |x)  ,  L  (y,  y2  y3  |x) 

4.  P(x)  ,  L  (y,  y2  |x)  ,  L  (y,  y2  y3  |x)  ,  L(y,  y2  y3  y4  |  x) 

5.  P  (x)  ,  L  (y,  y2  |  x) ,  L  (y,  y2  y3  I  x)  ,  L  ( y,  y2  y3  y4  I  x )  , 

L<y,  y2  *3*4  y5  «x> 

6.  P(x),  L(y,y2|x),  L(y1y2y3|x),  L  (y,  y2  y3  y4  |  x )  f 
My,  y2  y3y4  y5  lx)  ,  My5y6|x),  L  (y,  y2  y5  V6  |x)  , 

L<y,  y5'x) *  L(y,  y2y3y4y5y6 |x) 


where:  x 

y 

1 

2 

3 

4 

5 

6 


language  =  A,B,C,D,E 

sound  category  =  F,N,S,V 

first  sound 

second  sound 

third  sound 

fourth  sound 

last  sound  =  n 

next  to  last  sound  =  n-1 
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V.  RECOMMENDATIONS  FOR  FUTURE  STUDIES 


1.  The  model  finally  developed  has  considerable 
potential  which  was  not  exhaustively  exploited  herein.  The 
reference  data  should  be  vastly  expanded,  or  at  least 
altered  to  reflect  the  most  current  language/vocabularies  in 
use.  1000-1500  words  and/or  phrases  per  language  is 
suggested,  because  only  20%  of  that  range  was  used  here,  and 
it  appeared  that  word  uniqueness  was  overshadowing  language 
unigueness. 

2.  The  Prognostic  Progression  formula  (no.  7)  should  be 
tested  at  greater  lengths  to  identify  its  uniqueness,  or 
qualify  it  as  an  extension  of  Bayes  Theorem. 

3.  The  largest  problem  encountered  was  in  not  having  a 
spoken  speech  to  phoneme  converter.  Assuming  a  similiar 
lack  of  availability  for  future  researchers,  it  is 
recommended  that  high  quality  transcriptions  be  obtained 
earlyon,  and  the  quantity  of  transcripted  words  be  used  as  a 
criterion  for  continuance.  Further  phrase  transcriptions 
will  provide  better  results  than  word  lists. 

4.  Diacritical  marks  and  emphasis  were  not  considered 
herein,  but  may  prove  beneficial  in  further  language 
differentiation  developments. 
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VI. 


CONCLUSION 


The  following  are  presented  in  relation  to  the  findings 
herein: 

1.  The  IPA  does  appear  to  be  an  able  vehicle  to  support 
a  language  differentiation  scheme.  However,  the  ROMI 
conversion  alphabet  should  have  been  expanded  to  reflect 
diacritical  marks  across  the  languages  used. 

2.  Single  sounds  are  definitely  an  inadequate 
differentiation  approach.  Two-sound  combinations  present 
only  borderline  results.  At  the  three-sound  point  definite 
uniqueness  appears,  and  is  improved  with  each  additional 
sound  brought  into  the  combinations.  The  ending  sounds 
(backdoor  approach)  begin  to  present  patterns  of  interest  at 
the  two-sound  point,  and  should  have  been  explored  at 
greater  length.  For  the  languages  used  herein  (all 
European)  it  is  generally  concluded  that  patterns  emerged 
which  were  unique  in  spite  of  the  limited  sample  size. 

3.  The  final  analysis  and  decision  model  (computer 
program  2)  is  considered  a  successful  tool  for  language 
differentiation.  It  appears  to  have  considerable  potential 
for  expanded  analysis. 

4.  Perhaps  the  most  important,  though  unexpected,  result 
was  obtained  via  the  "loopback”  testing  of  the  model.  This 
test  pointed  out  the  model  was  capable  of  pattern  matching 
decisions  of  high  reliability  with  small  test  samples(<5). 
This  certainly  presents  an  improvement  to  the  technique  of 
word  matching,  in  that  storage  requirements  and  accessing 
times  are  greatly  reduced.  This  potential  is  limited  only 
by  the  ability  to  update  the  data  base  relative  to  current 
desired  operability. 
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APPENDIX  A 


THE  SPEECH  SYSTEM 

Hhen  discussing  the  speech  system,  it  is  quite  possible 
to  delve  into  many  scientific  fields  of  endeavour  in  order 
to  fully  exploit  the  interaction  and  functions  of  the 
components.  In  an  overview  the  players  in  a  system  diagram 
analysis  (figure  1)  are  the  transmitting  section  and  the 
receiving  section. 


TRANSMITTER 


RECEIVER 


Figure  A-1.  Block  Diagram  of  the  Speech  System 
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The  receiving  section  is  comprised  of  the 
organ,  the  brain,  and  the  responders.  A  schematic 
hearing  system  (figure  2)  shows  a  subdivision  into 
outer  ear,  (2)  the  middle  ear,  and  (3)  the  inner  ear. 


» 


Fic.  75. — SemidiaCrammatic  Section  through  the  Right  Ear  (Czermah):  G,  external 
auditory  meatus;  T,  membrana  tympani;  P,  tympanic  cavity;  o,  fenestra  ovalis; 

R,  FENESTRA  ROTUNDA;  B,  SEMICIRCULAR  CANAL;  S,  COCHLEA;  Vt,  SCALA  VEST1BULI; 
Pt,  SCALA  TYMPANi;  E,  EUSTACHIAN  TUBE;  R,  PINNA. 


Figure  A-2.  The  Hearing  System 


hearing 
of  the 
(1)  the 
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The  outer  ear  functions  as  an  antenna,  and  both  ears  used 
together  comprise  a  directional  antenna  system  i.e.,  capable 
of  determining  the  direction  of  the  sound  source  relative  to 
the  listener.  The  outer  ear  also  contains  the  feed  system 
to  the  receiver.  The  middle  and  inner  ear  comprise  the 
receiver  itself.  The  middle  ear  senses  the  presence  of 
sound  and  passes  the  mechanical  energy  to  the  inner  ear, 
where  it  is  converted  to  electrical  impulses  modulated  in 
both  amplitude  and  frequency.  These  impulses  are  then  sent 
to  the  brain  which  detects  the  intelligence,  and  activates 
decision  circuits  to  provide  physical  response  as  necessary. 
The  Eustachiam  tube  (E  in  figure  2)  '  insures  that  there  is 
equal  pressure  on  both-  sides  of  the  membrana  tympani 
(eardrum) .  The  Eustachian  tubes  also  provide  some 
side-tonal  pressures  whenever  speech  or  sound  is  generated 
at  its  input  (upper  part  of  the  throat  behind  the  nasal 
cavity) .  This  side-tonal  effect  appears  to  provide  a 
negative  feedback  to  ’'pad”  or  reduce  the  effects  of  the 
amplitude  of  sound  waves  emitted  when  they  arrive  at  the 
receiver  input  of  the  "transceiver"  system.  An  analogy  to 
the  "TR/ATR"  action  in  a  radar  is  certainly  suggested  by  the 
side-tone  effect.  The  functions  of  the  receiving"  mechanism 
are  those  for  which  a  mechanized  language  differentiation 
model  is  desired.  The  available  technoiogy  does  not  dictate 
a  direct  approach  to  the  solution,  however,  the  basics  are 
clear  in  the  "blackbox"  approach.  The  output  is  a  response 
resulting  from  a  decision  based  on  the  input  received.  The 
type  of  decision  device,  must  then  be  determined  from  what 
information  can  be  gleaned  from  the  input.  The  input  is 
assumed  to  be  an  exact  duplication  of  the  output  from  the 
transmitter  section.  Therefore,  if  it  can  be  determined  how 
the  transmitter  functions  based  on  observable  output,  it 
seems  feasible  that  some  model  could  be  developed  to 
interpret  the  information  the  transmitter  would  'send 
subsequently. 
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The  transmitting  section  is  of  much  greater  interest 
and  value  to  this  research.  Basically,  as  seen  in  figure  1, 
the  brain  sends  responder  messages  to  the  lungs  and  vocal 
shaping  components.  The  lungs  provide  the  mechanical  energy 
which  is  shaped,  amplitude  and  frequency  modulated,  and 
emitted  as  speech  or  vocalization.  Thus  the  transmitter  is 
the  human  vocal  system.  A.  block  diagram  of  the  system 
(figure  3)  gives  the  breakdown  of  the  vocal  system  into  four 
subdivisions.  The  schematic  diagram  (figure  4)  gives  the 
anatomical  layout  of  the  components. 


fig.  A-3.a.  The  "Black-box"  Approach 


fig.  A-3.b.  The  "Black-box"  Expansion 

Figure  A-3.  Human  Vocal  System 
(block  diagram) 
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Palate 


W-VV-Hofd - ^-Soft- 


Nasal  passage" 


Alveola 
ridge 

Central 
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I  | 
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Pharynx 


Tip—/ 

Tongue 

Sack 
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Thyroid  cartiloge 


•Trachea 


Figure  A-4.  Human  Vocal  System 
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The  lungs  act  as  a  power  supply,  collecting  air  such 
that  it  can  be  expelled  with  the  required  volume  and 
velocity  whenever  dictated  by  the  brain.  Both  the  voluntary 
and  involuntary  actions  by  the  lungs  are  used  in  the  speech 
process.  Voluntary  action  is  very  apparent  in  the  ''raised*' 
voice  or  "yelling"  process,  where  there  is  an  obvious 
increase  of  pressure  on  the  lungs  forcing  the  volume  and 
velocity  out  of  the  lungs  to  be  increased. 

The  oscillator  and  resonant  cavities  consist  of  the 
glottis  (vocal  chords)  ,  the  pharyngeal  passage  (upper 
throat)  ,  the  oral  cavity  (the  mouth) ,  and  the  nasal  passage. 
The  glottis,  when  excited  by  an  expulsion  of  air,  vibrates 
or  oscillates  at  a  fundamental  frequency  (male  average  is 
130Hz,  female  average  is  205Hz)  [Hef.  18],  and  generates 
many  harmonics.  The  frequency  actually  emitted  may  be  the 
fundamental,  a  harmonic,  or  some  combination.  In  general, 
when  sounds  are  emitted  there  is  an  amplification  of  some 
parts  of  the  spectrum  and  suppression  of  others,  much  like 
the  characteristics  of  a  bandpass  filter,  i.e.,  selective 
transmission.  The  fundamental  frequency  is  a  result  of  a 
number  of  factors,  namely  the  size,  shape,  and  elasticity  of 
the  glottis,  the  circumference,  length,  and  shape  of  the 
trachea,  and  the  size  shape,  and  depth  of  the  oral  and  nasal 
cavities;  specifics  for  typical  dimensions  and  frequencies 
relative  to  these  paramenters  can  be  found  in  appendix  B. 
In  the  aggregate  these  factors  constitute  a  tuned  waveguide 
and/or  cavities  which  tend  to  alter  the  basic  frequency  of 
the  glottis  vibrations.  In  support  of  this,  consider  what 
occurs  when  the  velum  is  opened  (by  its  trapdoor  action)  to 
allow  air  to  flow  through  the  nasal  cavity;  the  volume  of 
the  system  is  increased  which  is  consistant  with  a  lower 
frequency  of  resonance  normally  attributed  to  the  nasalized 
tones. 
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The  pulse  shaper  provides  the  ultimate  control  of  power 
during  constant  output.  The  shapers  consist  of  the  tongue, 
teeth,  lips,  jaw,  and  palate.  The  shapers  affect  both  the 
frequency  and  amplitude  of  the  transmitted  waves  ind  thus 
act  very  similiar  to  a  modulator  section  in  a  radio 
transmitter.  Frequency  variability  occurs  in  the  range 
available  from  the  fully  open  to  the  fully  closed  mouth,  and 
has  numerous  descrete  frequencies  in  between  these  extremes. 

The  antenna  or  radiating  system  consists  of  the  actual 
oral  and  nasal  orifices.  The  lips  and  jaw  both  work 
independently  and  together  in  the  shaping,  frequency, 
amplitude,  and  emission  operations.  The  parameters  of 
importance  in  the  output  are  again  frequency  and  amplitude. 
The  sound  wave  emitted  will  be  of  higher  frequency  when  the 
mouth  is  more  in  the  closed  position  than  open.  The  affects 
of  the  nasal  chamber  is  controlled  by  the  velum,  which 
determines  the  amount  of  air  (by  volume)  allowed  to  pass  in 
the  cavity.  The  nasal-oral  combinations  regulate  the  output 
frequency  by  combining  the  orifice  controls  of  the  velum  and 
the  tongue  position.  These  combination  sounds  occur  more 
frequently  in  languages  other  than  English. 


RECOGNITION  OF  SOUNDS 

Every  emission  via  the  vocal  system  falls  into 
categories  which  have  been  previously  developed  and 
quantized  in  varying  degrees.  Numerous  experiments  have 
been  conducted  particularly  in  the  speaker  recognition  area 
where  the  uniqueness  within  these  categories  are  further 
quantized  to  search  out  factors  which  are  undeniably  speaker 
dependent,  and  thus  provide  keys  to  recognition  algorithms. 
Doherty  [Ref.  19]  used  25  significant  factors  in  his  Long 
Term  Spectra  (LTS)  analysis  to  achieve  100%  recognition  (a 
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rarity  in  recognition  results)  for  a  sample  size  of  50. 
Doherty  did  not  provide  a  specific  breakdown  of  the  25 
factors  he  used,  however,  he  did  indicate  that  various 
combinations  of  frequency,  times,  nazalization ,  pauses, 
stops,  and  fricatives  were  some  of  the  factors  used.  Others 
such  as  Young  [Ref.  20]  have  used  a  single  category  of  sound 
to  promote  recognition  schemes.  Young  looked  specifically 
at  nazalization  and  achieved  results  from  50%  to  97%  with  a 
sample  size  of  13.  The  categories  are  relative  then  to 
recognition  of  many  entities-  in  speech.  Word/phrase 
boundaries  as  well  as  the  individual  components  and  sound 
patterns  can  be  identified  and  removed  for  analysis. 
Experimenters  make  every  effort  to  eliminate  odds  whenever 
possible,  therefore,  it  is  not  uncommon  to  find  that  when  a 
project  is  dedicated  to  a  specific  area  of  sound  to  research 
that  test  segments  are  carefully  screened  to  be 
representative  of  the  specific  area.  Sounds  can  be 
classified  by  a  number  of  parameters  (time,  frequency,  form, 
etc.)  One  such  classification  technique  categorizes  sounds 
by  form,  i.e.,  by  the  way  they  are  generated. 


The  categories  of  concern  are:  the  vowels  (v) ,  the 
stops  (s)  ,  the  nasals  (n)  ,  the  fricatives  (f)  ,  and  the 
pauses  (p) . 

The  vowel  sounds  are  that  group  which  flow  smoothly 
(uninterupted)  from  the  lungs  through  the  oral  orifice 
utilizing  the  whole  of  the  vocal  cavity,  with  minor 
adjustments  in  resonance  promoted  by  the  specific  location 
of  the  tongue.  Vowels  are  further  defined  according  to  the 
tongue  position  with  respect  to  the  oral  cavity  (mouth)  in 
general.  The  front  vowels  are  generated  whenever  the  tongue 
position  toward  the  front  of  the  mouth  is  the  major  factor; 
the  sounds  specific  to  this  location  are:  i(HF),  I  (LHF)  , 

e(HMF),  £(LMF),  ®(LF).  The  back  vowels  are:  u(HB),  0  (LHB)  , 
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0  (M  B)  /  o(HLB)  ,  a  (LB)  .  The  central  vowels  are:  3*(ac)  ,  ^(MC)  , 
3  (MC)  ,  a(MC)  ,  a  (LHC)  .  In  addition  to  the  distinctive  vowels 
noted  above  are  a  group  of  connected  vowels  whose  separation 
are  said  to  alter  the  meaning,  therefore  they  are  taken 
collectively  and  have  the  special  sub-categorization  name 
"diphthongs.”  The  diphthongs  are  al,  aU,  oi,  and  ju. 
Diphthongs  are  also  referred  to  as  the  vowel-glides  since 
there  is  an  alteration  in  the  resonance  which  resemble  the 
glide  consonants. 

The  stop  category  consists  of  those  sounds  which  are 
emitted  as  a  ' puf f-of-air ,  i.e.,  no  energy  is  followed  by  a 
large  spike  cf  energy.  Stops  are  also  known  as  plosives  due 
to  the  energy  spike,  but  stop  is  more  commonly  used  since 
the  speaker  must  actually  establish  a  stop  in  the  smooth 
expulsion  of  sounds  to  generate  the  plosive  burst.  The  stop 
category  has  the  components:  p,  b,  t,  k,  and  g.  A 
sub-category  of  the  stops  is  the  affricates  tj  and  dj  which 
are  again  combinations  which  cannot  be  reduced  without 
changing  the  meaning,  therefore,  they  are  classified  as 
unique  and  essential.  t 

The  third  group  is  known  as  the  fricatives.  These 
sounds  require  a  velocity  change  which  is  achieved  by 
creating  a  narrow  slit  between  the  upper  teeth  and  the  lower 
lip,  the  upper  teeth  and  the  tongue,  or  the  upper  teeth  with 
the  lower  teeth.  The  objective  is  to  create  a  higher 
frequency  sound  at  a  higher  velocity.  The  components  of 
this  category  are:  f,  v,  d,  3,  s,  z,  J*,  J,  and  h.  At 
present  there  are  no  apparent  sub-categories  which  are 
considered  unique. 

The  nasal  category  actually  consists  of  glides,  nasals, 
and  laterals.  The  glide  is  so  called  because  of 
articulation  organ  movement  required  in  its  generation.  For 
example,  note  that  when  the  single  sound  of  "r"  is  generated 
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the  oral  cavity  enlarges,  specifically  note  that  the  tongue 
drops  in  position.  Enlarging  the  vocal  cavity  tends  to 
lower  the  frequency  of  resonance,  such  that  the  sound  is 
readily  identifiable  as  a  smooth  transition  from  a  high  to  a 
lower  frequency.  The  nasal  sounds  are  those  sounds  which 
are  emitted  via  the  nasal  cavity  only,  that  is  the  mouth  is 
blocked  by  a  tight  closing  of  the  lips  for  the  sound  of  "m,” 
and  by  the  tongue  pressed  firmly  against  the  hard  palate  for 
the  generation  of  "n."  The  final  component  of  this 
sub-category  is  generated  when  the  mouth  is  blocked  by  the 
center  and  back  of  the  tongue  to  form  the  sound  associated 
with  -g  .  The  lateral  has  a  single  component  "l.”  The 
lateral  utilizes  both  the  nasal  and  oral  cavities  for  the 
generation  of  this  glide-like  sound. 

The  final  category  is  that  of  pauses.  A  pause  is  an 
obvious  lack  of  sound  of  sufficient  duration  so  as  to 
provide  a  well-defined  boundary  for  a  word  or  phrase  as 
appropriate.  Pauses  really  represent  a  special  situation 
that  requires  a  more  indepth  discussion. 

The  pause  is  that  period  in  speaking  which  is  denoted 
by  the  total  absence  of  sound.  In  writing  one  provides  word 
separations  cr  boundaries  by  the  absence  of  script  or  by 
leaving  an  intentional  space.  Unfortunately  this  does  not 
follow  in  speaking.  In  speech,  words  'are  run  together  such 
that  a  combination  of  words  (hereforth  defined  as  a  phrase) 
would  appear  as  a  single  utterance  which  is  devoid  of 
pauses.  For  example,  the  phrase  "the  brown  cow”  is  spoken 
as  "thebr cwncow. "  In  a  transcription  from  the  text  the 
phrase  becomes  ”cfe  braun  kau,”  but  the  spoken  appears  as 
"cJabraunk  u."  This  draws  forth  the  problem  of  sound  pattern 
significance  for  the  two  cases  described  as  (1)  with  and  (2) 
without  pauses. 

It  appears  as  though  a  pause  may  or  may  not  be  present 
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with  equal  probability.  In  the  case  of  English  it  is 
observed  that  three  distinct  areas  enter  into  the  study  of 
pause  insertion.  Speaking  generally  for  the  U.S.A.,  the 
Northeast  coast  population  speak  very  rapidly  and  pauses  are 
difficult  to  detect;  Southerners  speak  very  slowly,  so  that 
pauses  are  distinct  and  occur  more  often; 

Resterners/mid-westerners  fall  somewhere  in  between, 
representing  a  median  for  the  extremes.  The  actual 
detection  of  word  boundaries  via  the  spectrograph  or 
similiar  process  has  not  been  achieved  with  any  degree  of 
accuracy,  and  is  particularly  poor  with  an  unknown  sample 
[Ref.  21].  Therefore  the  problem  must  be  looked  at  from  two 
points  of  view.  The  first  looks  at  the  sound  patterns  based 
on  individual  word  list  inputs.  The  premise  here  is  that 
the  probability  of  any  word  beginning  a  phrase  could  be 

high.  From  the  words  of  the  phrase  "the  brown  cow"  one 
would  certainly  expect  the  article  "the"  to  follow  a  pause 
with  a  high  degree  of  certainty.  The  word  "brown"  could  be 
.  given  a  probability  assignment  of  .5  (equiprobable  that  it 
would  or  would  not  begin  a  phrase  with  some  certainty)  .  The 
word  "cow"  would  have  a  much  lower  degree  of  certainty 
associated  with  it,  but  greater  than  zero,  since  words  such 
as  "cowboy,"  etc.  exist.  A  second  argument  would  be  that 
only  phrases  (devoid  of  pauses)  exist  and  they  therefore, 
should  be  the  only  sound  strings  used  in  the  analysis. 

The  solution  to  these  points  as  far  as  this  thesis  was 

concerned  was  to  look  at  both  words  and  phrases, 

individually  and  collectively.  The  rationale  being  that 
since  the  analysis  only  looks  at  the  first  four  and  last  two 
of  the  scund  string,  and  since  there  is  sufficient 
probability  that  any  given  word  may  be  the  first  word  in  a 
given  string  (phrase)  ,  that  the  characteristics  of  that 
individual  word  are  very  germane  to  the  sound  string 
analysis.  The  use  of  phrase  analysis,  recognizes  that  pause 
detection  may  be  very  applicable  in  the  conversion  of  spoken 
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sounds  to  a  computer- acceptable  input. 


A  series  of  sound  emissions  can  be  observed  either  in 
whole  or  in  part  wherein  individual  sounds  can  be  sensed 
categorically  or  uniquely,  dependent  upon  the  type  of 
analysis  which  is  desired.  In  the  analysis  by  Young  [Ref. 
20]  he  was  able  to  concentrate  on  the  nasal  phonetation  by 
first  carefully  selecting  words  (including  nonsense  words) 
which  clearly  contained  significant  nasal  sounds,  then  via  a 
masking  or  filtering  technique  he  removed  those  quantities 
which  were  not  of  concern  to  him.  Others  have  used  a 
variety  of  masking  techniques  to  eliminate  all  but  the 
desired  elements  of  the  test  speech. 

The  concern  herein  was  not  to  extract  any  sounds  or 
rely  on  the  uniqueness  of  sounds  for  the  purpose  of 
recognition.  Rather  the  intent  was  to  establish  whether  or 
not  sound  patterns  are  unique  to  languages,  on  either  an 
individual  or  a  categorical  basis.  In  order  to  achieve  this 
end  it  is  necessary  that  sounds  can  be  uniquely  defined 
categorically. 


ELECTRONIC  PERCEPTION  OF  SOOND 

Many  attempts  to  fully  comprehend  the  operation  of  the 
vocal  system  have  been  made  through  thd  ages.  A  large 
number  of  models  were  constructed  across  time,  many  were 
unsuccessful,  while  others  proved  major  contributions.  In 
the  late  1700's  three  mechanical  models  were  independently 
produced  by  Kratzenstein,  VonKempelen,  and  Wheatstone,  each 
of  which  used  bellows,  leather  resonators,  and  reeds  to 
create  fricative,  plosive,  and  vowel  sounds  [Ref.  18].  The 
first  real  electrical  model  of  any  value  was  the  "V0DER,,, 
(Voice  Operation  DEmonstratoR)  ,  developed  by  H.  Dudley  in 
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the  1930's,  (Dudley  also  developed  the  first  VOCODER:  voice 
CODER;  also  in  the  1930's).  The  VODER  consisted  of 
oscillators,  resonance  chambers  and  a  keyboard  to  control 
the  frequency,  amplitude,  and  duration  of  each  sound  in  the 
formulation  of  words  at  its  speaker  output.  The  VODER  had  a 
vocabulary  of  2500  words  [Ref.  22,  pp.  16,  32,  49,  191]. 

The  spectrograph  is  an  analysis  tool  which  came  into 
being  in  the  1940's  as  a  result  of  studies  by  Koenig,  Dunn, 
and  Lacy  [Refs.  3  and  23].  There  are  various  types  now 
being  used,  however  they  have  common  operations  in  that  all 
take  in  speech  sounds,  filter  separate  the  sound  into  bands 
(bandpass  filtering)  and  display  the  results  on  a  frequency 
(vertical)  versus  time  (horizontal)  basis.  The  spectrograph 
uses  basically  a  Fourier  analysis  processing  [Ref.  3],  One 
motivating  factor  behind  the  development  of  the  spectrograph 
was  the  desire  to  have  a  visible  speech  mechanism  to  aid  the 
learning  process  for  deaf  persons.  Many  side  uses  were 
subsequently  developed  for  the  spectrograph.  Peterson  and 
Barney  used  the  spectrograph  technique  to  confirm  that  most 
English  sounds  occur  in  the  standard  telephone  bandwidth  of 
300-3300HZ  [Ref.  18,  p.  154].  Spectrograms  have  been  used 
as  a  means  to  develop  speaker  identification  through 
" voiceprints.  '•  Voiceprints  are  unfortunately  not  at  a  state 
where  100%  accuracy  is  consistant  with  experimental  results. 
Therefore,  voiceprints  promote  (presently)  a  shadow  of  doubt 
in  their  reliability  and  are  not  accepted  as  a  legal  tool. 
The  voiceprint  area  is  one  of  considerable  interest  to  the 
speaker  recognition  subgroup  of  the  Acoustic  Society  of 
America,  and  was  the  subject  of  attention  during  a  special 
open  discussion  during  the  90th  meeting  in  San  Francisco 
[Ref.  9]. 
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APPENDIX  3 


Some  Characteristics  of  Vocal  Tract  Transmission 


FORMANT  =  manifestation  of  the  normal  modes  as  spectral 
peaks  in  the  output  sound,  which  are  derived  from 
the  transmission  function  relating  the  mouth  and 
glottal  volume  currents.  The  transmission 

function  is:  ( 

ai  jsinhal  cosh^a+j/3)l  cos(g-) 
where  =  w/c  and  w  =  (2n  +  1)y-y- 

Formant  Parameters: 

La  =  x  =  acoustic  inertance/unit  length 

Ca  =  ~  accoustic  compliance/unit  length 

Ra  =  ' -  acoustic  resistance/unit  length 

Ga  -  — '  =  acoustic  conductance/unit  length 

A  =  tube  area  =  5cm2  =  0.775  in2 

r  =  radius  of  tube  =  1.26cm  =  0.497in 

S  =  tube  circumference  =  7.  9265cm  =  3.  12in 

p  =  air  density  =  1.14  x  10  "3gm/cm2, 

(moist  air  at  body  temperature  =  37  C) 

C  =  sound  velocity  =  8.5  x  104cm/sec, 

(moist  air  at  body  temperature  =  37  C) 

4  =  viscosity  coefficient  =  1.86  x  10  ♦  dyne-sec/cm2, 

(20  C,  0.76  in.Hg.) 

>  =  coefficient  of  heat  conduction 
=  0.055  x  10  3cal/cm-sec-deg  (0  C) 

7  =  adiabatic  constant  =  1.4 

cp  =  specific  heat  of  air  at  constant  pressure 
=  0.24  cal/gm-deg  (0  C,  1  atmos.) 

1  =  tube  length  =  17cm  =  6.693in  (for  avg.  male) 


f 


a  (11141) 
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a 


£a. 

2 


=  ♦ 

The  formant  frequencies 

w  =  (2n  +  1)  (-f-)  (-S-) 

=  ( 2n  +  1) (3230/sec) , 

n=0, 1,2, . . . 

v  =  3230  radians 
w  =  9690  radians 
v  =  16160  radians 
»  =  29070  radians 


as  computed  are: 

f  =  (2n  +  1)  (--) 

=  (2n  +  1)  (514.4)  , 
n=0, 1 , 2, .  .  . 

f  =  514.4  Hz 
f  =  1543.3  Hz 
f  =  2571.9  Hz 
f  =  4630  Hz 
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APPENDIX  C 


TRANSCRIPTION  SYMBOLS 
30MI  to  IPA  Conversion  Tables 

Table  C-I.  Fricative  Conversion 


ROMI 


IPA 


F 

V 

Q 

T-H 

S 

Z 

S-S 

z-z 

H 

Q-A 

B-B 

C-H 

S-H 

Z-H 

X 

G-F 

X-X 

R-B 

H-R 

D-R 

Z-C 

J-A 

G-B 


f 

v 

9 

3 

s 

Z 

J 

I 

h 

§ 

0 

C 

s 

z 

X 

g 

X 

6 

* 


j 

y 
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Table  C-II.  Nasal  Conversion 


ROHI 


IPA 


M 

N-N 

N-J 

L 

W 

H-W 

J 

R-R 

R 

M-J 

J-N 

N-Q 

N 

A-N 

K-Z 

Y-0 

L-L 

R-T 


m 

n 

1 

V 

hw 

j 

r 

R 

P 

N 

4- 

h 

l 

f 
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Table  C--III .  Stop  Conversions 


ROMI 


IPA 


P  P 

T  t 

B  b 

D  d 

K  K 

G  g 

C-C  c 

F-F  1 

T_T  t 

D-D  d 

Q-Q 
G-G 
N-D 


AFFRICATIVES 


q 

G 

? 


Table  C-IV.  Vowel  Conversion 


ROMI 


IPA 


I-I 

I 

E-E 

E 

A-E 

A 

C 

0 

0 

0-0 

v-v 

0-T 

U-H 

E-R 

A-A 

A-B 

3-R 

Y-Y 

Y 

0-0 

0-E 

G-A 

W-W 

I-T 


i 

i,  i 

e 

£ 

32 

a 

C 

0 

0 

u 

A 

3 

9r  (&) 

3* 

a 

3 

7 

Y 

<t> 

oe 

Ui 

i 

DIPHTHONGS 


H 

1 

al 

H 

» 

U 

I 

J-0 

ju 

> 

1 

a 

aO 

o 

i 

a 

oU 
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Table  OIV.  (cont. )  : 

1251 

C-E 

E-Q 

A-R 

I-R 

D-S 

E-S 

1-0 

U-Q 

C-Q 

E-I 


DIPHTHONGS  IPA 

03* 

£ 

CL* 

i 

0 

I 

0 

03 

el 
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APPENDIX  D 


The  data  and  graphs  contained  in  this  appendix  were 

used  in  an  attempt  to  achieve  a  decision  model  to 

differentiate  languages.  In  each  language  the  data  reading 

across  the  sound  rows  are  normalized  to  100;  the  columns 
•< 

were  not  normalized,  however,  the  columnar  data  were  used  to 
compute  the  mean  E  (x)  ,  and  the  standard  deviation  <r(x)  .  The 
graphs  were  generated  in  accordance  with  formula  (4). 


Table  D-I.  Combined  Vector  Data 


Lng 

Sounds 

F 

N 

S 

V 

#  wds 

A 

1 

33.  48 

16.29 

39.82 

10.41 

221 

A 

1-2 

0.98 

7.  84 

5.88 

85.29 

204 

A 

1-2-3 

17.  46 

45.24 

7.14 

30.  16 

126 

A 

1-2- 3-4 

— 

A 

n 

15.  46 

54.63 

11.34 

18.56 

97 

A 

n- 1 

1.35 

17.57 

5.41 

75.68 

74 

A 

E(x) 

13.75 

28.31 

13.92 

44.02 

A 

<H*) 

10.  22 

18.21 

13.  12 

30.58 

B 

1 

28.87 

16.49 

23.  71 

30.93 

97 

B 

1-2 

4.55 

39.39 

1.52 

54.55 

66 

B 

1-2-3 

9.80 

25.49 

27.45 

37.25 

51 

B 

1-2- 3- 4 

— 

B 

n 

7.3  1 

14.63 

0.00 

78.05 

4  1 

B 

n-  1 

24.  14 

20.69 

34.48 

20.69 

29 

B 

E(x) 

14.93 

23.34 

17.43 

44.29 

B 

<?(x) 

9.71 

8.  85 

14.05 

20.15 

63 


Table  D-I  (cont.) 


c 

1 

20.61 

20.33 

26.  18 

32.87 

3  59 

c 

1-2 

10.53 

26.32 

9.72 

53.44 

247 

c 

1-2-3 

1-2- 3-4 

17.01 

31.44 

19.07 

32.47 

1  94 

c 

c 

n 

3.53 

32.35 

7.06 

57.06 

170 

c 

n- 1 

9.22 

36. 17 

17.02 

37.59 

1  41 

c 

E(X) 

12.  18 

29.32 

15.81 

42.67 

c 

tf(x) 

6.01 

5.48 

6.83 

10.48 

D 

1 

25.31 

23.65 

19.92 

3T.  12 

241 

D 

1-2 

4.76 

16.67 

14.  17 

74.40 

1  68 

D 

n 

1-2-3 

1  -2-3-4 

19.27 

32.  1 1 

11.93 

36.70 

109 

D 

n 

15.  66 

27.71 

25.30 

31.33 

83 

D 

n-1 

13.04 

18.84 

18.84 

49.28 

69 

D 

E(x) 

15.61 

23.80 

18.03 

44.57 

D 

6.81 

5.65 

4.67 

16.31 

E 

1 

33.22 

14.29 

32.22 

18.27 

301 

E 

1-2 

7.36 

22.48 

21.32 

48.24 

258 

E 

1-2-3 

12.27 

25.00 

25.90 

36.82 

220 

E 

1-2- 3-4 

— 

E 

n 

10.84 

13.79 

7.88 

64.49 

2  03 

E 

n- 1 

13.  16 

28.95 

29.47 

28.42 

203 

E 

E(x) 

15.  37 

20.90 

23.36 

39.85 

E 

<r(x) 

9.14 

5.  97 

8.56 

16.97 
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Using  the  data  from  table  D-I  and  the  following 
information  from  table  6.1  of  Lipschutzf Ref .  17],  points 
were  calculated  for  the  magnitude  coordinates  at  0,  ±|  ,  ±2, 
and  ±3  standard  deviations,  for  each  category: 


0  =  0.4 

±1 

=  0 

.  242 

±2=0 

.054 

±3  = 

0. 0044 

tftxr 

xr 

tffxj* 

(AW~ 

Table  D- 

-tl. 

Fricative 

Magnitudes 

-3 

-2 

-1 

0 

1 

2 

3 

A  ,X 

-  16.91 

-6.69 

3.53 

13.75 

23.97 

34.  19 

44.41 

A,F(X) 

.0004 

.005 

.024 

.  039 

.024 

.005 

.0004 

A,  cr(x) 

10.22 

Q  ,  X 

-14.2 

-4.  49 

5.22 

14.93 

24.64 

34.35 

44.06 

B,f  (X) 

.  0005 

.  006 

.025 

.  041 

.025 

.006 

.0005 

B  ,  C'(X) 

9.71 

C,X 

-5.85 

0.  16 

6.  17 

12.18 

18.  19 

24.2 

30.21 

C,f  (X) 

.  0007 

.009 

.040 

.067 

.040 

.009 

.0007 

C,  tf(X) 

6.01 

D,X 

-4.82 

1.99 

8.8 

15.61 

22.42 

29.23 

36.04 

D,f  (X) 

.  0006 

.008 

.036 

.059 

.036 

.008 

.0006 

D,  tf(x) 

6.81 

E,X 

-12.05 

-2.  91 

6.23 

15.37 

24.51 

33.65 

42  .79 

Z,f  (X) 

.0005 

.  006 

.026 

.044 

.026 

.006 

.0005 

B  t  O^X) 

9.8 
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66 


Table  D-III.  Nasal  Magnitudes 


-3 

-2 

-1 

0 

1 

2 

3 

A,X 

-26.32 

-8.  11 

10.  1 

28.31 

46.  52 

64.73 

82.94 

A,f  (X) 

.  0002 

.  003 

.013 

.022 

.013 

.003 

.0002 

A  /  <T(X) 

18.21 

B,X 

-3.21 

5.64 

14.49 

23.34 

32.  19 

41.04 

49.89 

B  /  f  (X) 

.  0005 

.006 

.027 

.045 

.027 

.006 

.0005 

B  t  ef{x) 

8.85 

c,x 

12.88 

18.36 

23.84 

29.32 

34.  80 

40.28 

45.76 

c,f  (X) 

.  0008 

.01 

.044 

.072 

.044 

.01 

.0008 

C,  ^(X) 

5.48 

D  /  X 

6.85 

12.5 

18.  15 

23.80 

29.45 

35. 10 

40.75 

D,f  (x) 

.0007 

.010 

.043 

.071 

.043 

.010 

.0007 

D,  tf(x) 

5.65 

E ,  X 

2.99 

8.96 

14.93 

2  0.90 

26.87 

32.  84 

38.81 

E,f  (X) 

.  0008 

.  009 

.041 

.067 

.041 

.009 

.0008 

E,<T(X) 

5.97 

67 


68 


Table  D-IV.  Stop  Magnitudes 


-3 

-2 

-1 

0 

1 

2 

3 

A,X 

-25.44 

-12.32 

0.80 

13.92 

27.  04 

40. 16 

53.28 

A,f  (X) 

.  0003 

.004 

.018 

.031 

.018 

.004 

.0003 

13.  12 

B  ,  X 

-24.72 

-10.67 

3.38 

17.43 

31.48 

45.53 

59  .58 

B,f  (X) 

-  0003 

.004 

.017 

.028 

.017 

.004 

.0003 

c,  <r(x) 

14.05 

c,x 

-4.68 

2.  15 

8.98 

15.81 

22.64 

29.47 

36.30 

C,f  (x) 

.0006 

.008 

.035 

.059 

.035 

.008 

.0006 

C,  <T(x) 

6.83 

D  ,  X 

4.02 

8.69 

13.  36 

18.03 

22.70 

27.37 

32.04 

D,f  (x) 

.  0009 

.012 

.052 

.086 

.052 

.012 

.0009 

Dr  <T(X) 

4.67 

E ,  X 

-2.32 

6.24 

14.8 

23.36 

31.92 

40.  48 

49  .04 

Erf  (X) 

.  0005 

.  006 

.028 

.  047 

.028 

.006 

.0005 

E,  (fW 

8.56 

69 


70 


Table  D-V.  Vowel  Magnitudes 


-3 

-2 

-1 

0 

1 

2 

3 

A,X 

-47.72 

-  17.  14 

13.  44 

44.02 

74.60 

105.18 

135.76 

A,f  (X) 

.0001 

.002 

.008 

.013 

.00  8 

.002 

.0001 

A,  £T(X) 

30.58 

B  r  X 

-16.  16 

3.99 

24.  14 

44.29 

64.44 

84  .59 

104.  74 

B  r f  (X) 

.  0002 

.  003 

.012 

.020 

.012 

.003 

.0002 

B,  tf(x) 

20.15 

c,x 

11.23 

21.71 

32.  19 

42.67 

53.  15 

63.63 

74.11 

c,f  (X) 

.0004 

.005 

.023 

.038 

.023 

.005 

.0004 

C,  tf(X) 

10.48 

D  ,  X 

-4.36 

11.95 

28.  26 

44.57 

60.  88 

77.19 

93.5 

D,f  (x) 

.0002 

.  003 

.015 

.025 

.015 

.003 

.0002 

D,  tf(x) 

16.31 

E  ,  X 

-11.06 

5.91 

22.  88 

39.85 

56.82 

73.79 

90.76 

Erf  (X) 

.0003 

.003 

.014 

.024 

.014 

.003 

.0003 

E  r  J'(X) 

16.97 

/ 
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PERCENTAGE  OF  WORDS  LONGER  THAN  3-SOUNDS  WITH  BEGINING  4-SOUND  COMBINATIONS: 
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PERCENTAGE  OF  WORDS  LONGER  THAN  1-SOUND  WITH  ENDING  2-SOUND  COMBINATIONS: 
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PERCENTAGE  OF  WORDS  LONGER  THAN  2  SOUNDS  WITH  BEGINING  3-SOUND  COMBINATIONS: 
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PERCENTAGE  OF  WORDS  LONGER  THAN  3-SOUNDS  WITH  BEGINING  4-SOUND  COMBINATIONS 
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PERCENTAGE  OF  WORDS  WITH  LAST  SOUND  AS  NOTED. 

NUMBER  OF  WORDS  IN  SAMPLE  OF  LENGTH  5  OR  GREATER  IS: 
63.00000 
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PERCENTAGE  OF  WORDS  LONGER  THAN  1-SOUND  WITH  ENDING  2-SOUND  COMBINATIONS: 
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LANGUAGE 
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PERCENTAGE  OF  WORDS  WITH  FIRST  SOUND  AS  NOTED 


NUMBER  OF  WORDS  IN  SAMPLE  OF  LENGTH  1  OR  GREATER  IS: 
359.0000 
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PERCENTAGE  OF  WORDS  LONGER  THAN  ONE  SOUND  WITH  BEGINING  TWO-SOUND  COMBINATIONS 


sO 

o 

in 

vO 

f- 

SO 

O' 

CM 

CO 

> 

O' 

> 

> 

CO 

> 

CM 

H 

2 

CL 

CO 

r-^ 

> 

CM 

& 

CM 

CM 

• 

• 

• 

O 

• 

00 

<r 

CO 

• 

CO 

r-M 

.-4 

CM 

in 

CO 


or 

UJ 


< 

LU 

cc 

o 


cc 

o 

CO 

X 

K 

o 

2 


UJ 


LL 

o 

UJ 

-J 

CL 

2 

<1 

CO 

2 


o 

o 

o 

h- 

CO 

NO 

r- 

O' 

co 

CO 

r-4 

CO 

IS) 

CO 

CO 

• 

IT. 

LL 

2 

r» 

CL 

CO 

> 

NO 

sO 

o 

o 

s: 

o 

O' 

o 

r- 

CO 

• 

</) 

• 

• 

CO 

O' 

O  2 

2 

X 

-J 

o 

o 


II 

o  o  o  o  o  o 

LLcO 

CL  CL  CL  cl  CL  COC 

LL  2  CL  ^  >  O 

OJH 

ou 

<IUJ 

2  — 

UJ 

CJ 

or 

UJ 

CL  • 


2 

UJO 

CM 

r- 

o 

h- 

O' 

Ih 

CO 

GO 

o 

O 

CO 

hh 

h- 

2 

in 

2 

o 

2 

2 

CO 

2 

o 

H 

tn 

LL 

CO 

2 

vt 

CL 

CO 

CM 

> 

M- 

COCO 

CO 

o 

r- 

<o 

CO 

O 

CO 

• 

CL 

• 

• 

• 

• 

O 

sQ 

CM 

r- 

20 

CM 

=>2 

□G 

COO 

UJ 

Oco 


CO 

QO  O  O 

or  o 

OO  LL  LL  LL 

30  CL  Z  CL 

• 

ULf- 

U<t 

CM 

or 

UJ 

CO 

X 

x> 


O  UJ 

OX 

O 

UJK 

H 

CO 

00 

(O 

LL 

U. 

sQ 

2 

o 

CO 

> 

CM 

LL »— • 

CM 

in 

O 

in 

• 

UJ 

• 

O 

COO 

O 

f-H 

h2 

co< 

>or 

-JO 

<o 

20 

<o 


94 


O 

00 

0 

O  O 

O 

^-4 

O 

CO 

O' 

f  • 

> 

> 

s0 

> 

> 

> 

> 

> 

C0 

> 

> 

00 

LL 

2 

sT 

CL 

OO 

> 

LL 

2 

O 

CL 

l/> 

2 

LL 

UL 

in 

LL 

LL 

LL 

2 

2 

2 

2 

O 

in 

HH 

• 

• 

in 

*— 4 

< 

2 

HH 

CO 

X 

o 

o 


Q 

2 

O 

O 

O 

O 

CO 

00 

O  O 

O 

3 

on 

OO 

00 

m 

00 

r- 

a* 

00 

00 

O 

LL 

2 

CL 

00 

:> 

c\j 

LL 

2 

CL 

00 

00 

CO 

LL 

LL 

LL 

LL 

LL 

0 

• 

co 

2 

2 

2 

2 

o 


o 

II 


o 

< 

3 

o 

2 

< 


o 

UJ 

CO 


00 


cc 

LU 


X  < 
h-  UJ 
^  CL 

3  e> 

00  cc 
q  a 
2 

X  sf 

o 

00  X 

I— 

CNJ  O 


a 

LL 

LL 


CL 

a 

2 

Q_ 

LL 

LL 

Q 

00 

LL 


CL 

> 

LL 


O. 

LL 

2 


a 

cl 


CL 

</> 

2 


< 

_J 

0 

0 

0 

0 

O 

O  O 

O 

X 

OO 

LL 

2 

2 

2 

2 

2 

O 

2 

2 

2 

2 

O 

LL 

2 

CL 

00 

> 

*4" 

LL 

2 

CL 

00 

CC 

LL 

LL 

LL 

LL 

LL 

CO 

2 

2 

2 

2 

LU  LU 
O  -J 

2  a 
C  X 
-J  < 

00 

00 

a  2 

nc  i-i 

c 

3  ^ 


ao 

O 

0 

0 

O 

CO 

O 

O 

O 

LL 

aco 

CO 

O 

00 

LL 

LL 

U- 

LL 

LL 

r- 

LL 

IL 

LL 

LL 

30 

LL 

2 

CL 

00 

> 

cvj 

LL 

2 

CL 

00 

LL 

• 

LL 

LL 

LL 

LL 

LL 

2 

2 

2 

2 

O 

LL*4" 

0 

< 

OCT* 

• 

«-4 

CO 

2  CL 
UJ  UJ 
O  CO 
OL  X 

ll«  3 
cl  2 


95 


030927 


123711  3.608247  0  8.247416 


ooooooo^ooom 

^  *4" 


> 

> 

> 

> 

> 

> 

> 

> 

r- 

> 

> 

> 

> 

CO 

> 

LL 

Z 

CL 

CO 

> 

LL 

Z 

co 

Cl 

t/) 

> 

LL 

CO 

z 

Cl 

CL 

CL 

Cl 

a 

tO 

to 

C\l 

CO 

to 

00 

> 

r*- 

C\J 


t 


Ch 


o  o  o  o  o 


CO 

OO 

to 

CO 

CO 

00 

IL 

z 

cl 

to 

> 

LL 

a 

Cl 

CL 

CL 

CL 

CO 

o 

o 

O 

o 

r- 

»-H 

Ch 

LO 

to 

to 

to 

C\J 

OO 

m 

z 

Cl 

CO 

> 

CD 

LL 

vO 

to 

CO 

to 

to 

o 

o 

• 

(O 

> 

vT 

in 

• 

o 

o 

o 

o 

o 

o 

in 

CO 

o 

O 

00 

CNJ 

Ch 

z 

z 

z 

z 

z 

z 

z 

>o 

z 

z 

z 

O 

z 

ro 

LL 

z 

CL 

to 

> 

LL 

z 

a 

to 

> 

LL 

vO 

CL 

a 

a 

CL 

CL 

to 

CO 

in 

in 

• 

o 

to 

to 

00 

o 

r- 

• 

s0 

> 

in 

• 

fH 

o 

O 

o 

o 

o 

O 

o 

o  o 

GO 

04 

o 

LL 

IL 

IL 

U. 

LL 

LL 

LL 

LL 

U. 

U. 

O 

LL 

IL 

z 

Cl 

to 

> 

LL 

Z 

CL 

00 

> 

F^ 

LL 

a 

Cl 

CL 

CL 

a 

to 

00 

to 

to 

to 

o 

Is- 

• 

o 

> 

96 


CVJ 

o 

CVJ 

o 

CO 

O 

o 

> 

rH 

> 

> 

»— 4 

> 

>* 

2 

o 

a 

00 

o 

> 

Is- 

> 

r- 

> 

> 

r- 

> 

o 

nO 

• 

• 

• 

CVJ 

in 

in 

m 

i—4 

o 

o 

r- 

m 

O' 

CVJ 

i—4 

to 

cn 

oo 

oo 

oo 

O' 

CVJ 

2 

sO 

a. 

OO 

> 

o 

r- 

> 

> 

> 

> 

cn 

o 

in 

o 

•  • 

• 

• 

—  o 

#— 4  X  *— 4 

3 

CO 


ILO 

oo 


Q. 

a. 

> 


CL 

«/) 

> 


CL 

> 

> 


UJ  II 

o 

<OC 

KO 

21- 

LUU 

OUJ 

ar> 

LUw 

a. 


UJ 


X  • 

1-2 

o 

in 

r- 

O 

<*• 

r- 

in 

CVJ 

Ohm 

CVJ 

2 

»-i 

2 

2 

CO 

2 

cn 

2 

o 

Q. 

OO 

»— 4 

> 

o 

M 

> 

cn 

> 

> 

vO 

> 

cn 

00  00 

<r 

'O 

o 

o 

<a 

• 

• 

• 

• 

CL 

i—4 

CVJ 

o 

CO 

20 

3cd 

c*-« 

OOX 

I— 

o 


OC  UJ 
Hi 

a> 

0 

0 

in 

Xi- 

GO 

^-4 

cn 

1- 

CVJ 

UL 

m 

UL 

UL 

u_ 

'O 

2 

o 

2 

r- 

Cl 

00 

> 

-t 

LLm 

i—4 

> 

Is- 

> 

> 

> 

in 

O 

o 

m 

^4 

UJ 

• 

• 

in 

00  o 

Is- 

CVJ 

• 

*-•2 

o 

oo< 

>*a: 

-13 

<o 

20 

<o 


97 


PERCENTAGE  OF  WORDS  LONGER  THAN  3-SOUNDS  WITH  BEGINING  4-SOUND  COMBINATIONS 
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PERCENTAGE  OF  WORDS  WITH  LAST  SOUNO  AS  NOTED. 

NUMBER  OF  WORDS  IN  SAMPLE  OF  LENGTH  I  OR  GREATER  IS: 
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PERCENTAGE  OF  WORDS  LONGER  THAN  1-SOUND  WITH  ENDING  2-SOUND  COMBINATIONS 


> 

LL 


•  • 


r-M 

r- 

O 

o 

CM 

O' 

o 

CO 

CM 

> 

o 

> 

> 

CO 

> 

in 

z 

GO 

0u 

00 

CO 

> 

00 

CM 

r- 

ro 

O 

• 

CM 

o 

• 

in 

• 

<r 

CM 

i 

m 

• 

o 

o  r-  o 
o 

oo  oo  o  to  oo 

u-  z  vr  a  oo 

m 

GO 

CM 


O 


o> 

CO 

v)  in 
>  cm 
oo 

CO 


NO 


oo 


CC 

LU 

< 

LJJ 

CL 

o 

CC 

o 


m 

x 


o 

z 

LU 


LU 

-J 

CL 

2" 

<1 

00 


iS) 

< 


O  CM  o  O 

CO 

z  z  m  z  z 

ll  z  co  a  00 

o 

o 


o 

O' 

Z  CM 

> 

CM 

o 

CM 


O  • 

zz 

3C 

Ohm 

00K 

HM 

>—oo 

oo  o 

<tCL 


0< 

K  X 


oo 

ao 

CLO 

CO 

• 

LLP*- 

a<r 

CM 

CL 

LU 

CO 

x 

3 

Z 


xz 

0 

0 

o 

O 

P- 

LU  HM 

in 

z 

LL 

LL 

LL 

LL 

LL 

H 

UJ 

LL 

Z 

CL 

V) 

> 

m 

LL  O 

o 

az 

CM 

< 

• 

ooo c 

in 

*—»  3 

ooo 

>o 

-JO 

< 

Z  LL 
CO 


102 


263157  20.64775  0  9.716597  21.45746 


* 

* 

# 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

# 

* 

* 

* 

* 

* 

* 


* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

■* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 


* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 


* 

•* 

2 

on 

* 

* 

z> 

2 

* 

* 

>1/7 

>  >LL 

* 

* 

LL  2  > 

* 

* 

00  2 

* 

* 

LL  > 

00  > 

* 

* 

>00  2 

>  200  LL 

•  • 

* 

* 

> 

>2  >>  OOLL  > 

00 

* 

* 

LL 

00 

OO 

00>  ZOO  LL  >  >LL 

t— 

* 

* 

OO 

> 

200 

>U0  OO  >  >2 

* 

* 

> 

>ll2 

>2 

OO  >  00  ^ 

CL 

* 

* 

> 

oo2>  > 

2>>2 

>  OOLL  OOLL  2 

LU 

* 

z 

* 

> 

>>LL  > 

2LL> 

> LL  LL  LL  >  >  >> 

• 

\— 

* 

0 

> 

2LL2>>>  00 

mZlS)  >1 /)  OO  2  LLLL 

O 

< 

* 

• 

* 

>  >>  2  LL  > 

00 

>  LL  2 

UJ 

LU 

* 

* 

LL2LL22  2 

on 

>222 

LL  >  00  > 

I- 

CL 

* 

II 

> 

>  >>00  > 

2 

00>>> 

2>0O  >>OOZOO>LL 

O 

O 

* 

OO 

>  2U->2  2 

> 

LL  2 

>2  >  LL  >  >  LL  OO 

2 

* 

UJ 

* 

> 

2  Z  >  2LL  >  > 

> 

> 

200200  2  LL  2 

QC 

* 

0 

* 

>> 

>  >>>>LL>> 

LL 

ol>  > 

00>  LL  LL  > 

V) 

a 

* 

< 

* 

2  LL 

Z  2  >  2  LL  00  LL 

> 

2oO>2 

>00Z>  LL  22 

< 

* 

3 

* 

>> 

>>Z  >  > 

> 

>  00 

2  >  >  LL  >  2  00 

rH 

* 

0 

* 

2>>>>Z  XSlU-Z 

LL 

>  LL  22  >  U- 

O 

2 

* 

>LLLL 

ll>>2oo>>>ll 

00  00 

>  >  LL  OO  >  >  LL 

2 

X 

* 

<x 

* 

2  > 

>LL  >  >  2  >00  OO 

LLZLL> 

LL2LLZ  LLLL  LL  > 

Z) 

h- 

* 

* 

2  > > 2  >  >  2>  2 

2 

>  >  >  2 

>>  >U.>>>  LL 

O 

0 

* 

* 

>WZ>W>>Z22»> 

> 

OOLL  LL 

2  LL  OO  LL  2  LL  > 

00 

2 

* 

* 

LL  2ZIL  l/)>  >1/)  LOLL 

2 

LL  OO >  2  OO  >00Z 

UJ 

* 

# 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

•* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

# 

* 

* 

* 


00 


>> 


2  >  > 

>ZZ  oo> 

LLLL  >»2Z>Z>  200 

Z>  >oo</)  >2>Z  oo> 

>llooll>>  oo>oo>oo>Zoo 

LLLL>>LLOO>  >22ZlL^ 

Z>ZZ  >>>  ZZ  >>> 

>>>Z  00>Z00>>  LL>Z 

LL>>  C/}>>>  U0> 

>  >  ZZW  >z 
ZZZ  ooo0>  >  zz>>>00 

>00  2>LLZ  >Z>Z  ooll> 


OO  LL  >Z>  00> 

>>002  2  OOOOZlL  00  00 

ZZ>  LL  LL  00>>LL  2 

>>  O^LL  LL  LL>2  > 

OO  LLLL  ZZ»  2:  >LL  LL 

>2  >>  LL  >  O0> 

LL>  LL  Z  OOZLL  > 

LL  OO  >  OOLL  LLOOOOOOOO  > 

>  2  IL>0000  >>  OOLL 

>  >  >  OO  > 

OOLL  LL  2  00  200 

>O0>  LLZ  >2  Z>  00 

LL  > >  >>  LL>>  LL  > 

22  LOZZLL  LLLL 
>>  Z  2 

oooooo  >>> 

LLLL  OO 

LL2 


OO 

CC  LL 
►-  O 

LL 

LU 


2>  >>>  22>LL>  >2002  00 

:oozz>  >  Z  LL  >oo>2>  > 

Z>>  >>  >  >  LL  >  > 

200>2  OO 002200 LL2>>OOLL  LL  2  >oo2> 
>>>>LL>>>>>>>ZLL>>>  2>002>  _  _  _ 

2>2>>00>200>>2>>00>  >  LL  >  >>>oooo  22 

OO  LL  2  >  LL  2  O0LLO02  22O0> 

2  >  >  LL  >  LLOO  >> 

2  >  LL  LL  OOLL  >>  >  >LL 

>^>O0LL>>LL>>O0C0>22>O0  >  >  >  LL  2Z^>  2  >> 

>2>2>>  >  >  2  >  0000  >  LL  >  >22  >  >  >  LL  >  LLOO 

222 >2 2  22>oo>2222- 

2  >  >  >  >  >  LL2>>>>00; 

200LLOO  LL  2  LL  >>002LLLL>  00  002  00  > 

>  2  OO  LL  2200  >LL>>LLLLLO  LL2  LL  >>  2LLOO>>LO 

>OOLL>  >LL002>>>  >LL  00  0O>LL  200 OOLL  LL>LL>  LLLL  OOOO  >LL 


OO 

Cj 

CC 

o 

2 

LL 

O 

LU 

O 

< 

h- 

2 

LU 

O 

oc 

UJ 

CL 


CM 

cn 

o 

CM 


m 


LO  o 

r- 

O' 

• 

O' 


V* 

H 

in 

sO 

• 

m 

CM 


< 

00 


00 

00 

oco 

OO 

:so 

• 

LL  i—i 

o<t 

CM 

CC 

LU 

CD 

2 

Z> 


O' 


CO 

in 

CM 


103 


O) 

40 

4- 

0 

sO 

4) 

H 

2 

m 

O' 

h- 

r- 

0 

> 

(NJ 

> 

0 

> 

> 

>o> 

*4“ 

4“ 

»— • 

LL 

in 

2 

ao 

a 

on 

sO> 

O 

O 

h- 

4* 

m 

0 

O' 

4" 

< 

• 

• 

• 

• 

• 

2 

ao 

h- 

0 

r— 1 

4" 

HH 

CM 

r*- 

CD 

2 

O 

o 


o 
2 : 
o 


0 

CM 

O 

0 

O 

4" 

40 

CO 

in 

*—< 

0 

1 

on 

O 

O) 

CO 

CO 

l/> 

h- 

• 

40 

0 

LL 

O 

2 

CL 

on 

> 

in 

— • 

vO 

3 

ao 

ao 

2 

sO 

1— 

m 

h- 

ro 

<— 1 

• 

• 

on 

• 

0 

CM 

4* 

2 

2 

*—• 

2 

2 

•  • 

D 

H- 

lO 

— I 

0 

»— « 

a 

LU 

0 

GO 

QL 

LU 

it 

X 

f— 

O 

O 

0 

O 

O 

O 

z 

K- 

< 

LLoO 

O 

•-H 

LU 

Cl 

a 

CL 

CL 

CL 

C<* 

* 

2 

a: 

LL 

2 

CL 

CO 

> 

O 

0 

LUj— . 

li 

O 

OO 

2 

cc 

<t  LU 

uu 

ro 

0 

H> 

0 

0 

2^ 

< 

CO 

m 

LU 

0 

O 

0 

LU 

X 

QC 

2 

2 

h- 

LU 

c 

O 

O 

CL  • 

_J 

2 

2 

2 

LU 

LUO 

C 

-J 

4" 

O 

0 

4“ 

4- 

X~ 

vO 

X 

r—4 

O 

O 

l-H 

vO 

1- 

LL 

2 

h- 

2 

2 

2 

O'  2 

H-4 

40 

O 

LL 

in 

2 

a 

on 

onto 

43 

a: 

ao 

vO 

<0 

0 

LU 

LU 

r- 

h- 

• 

CL 

• 

0 

-J 

• 

• 

0 

O 

40 

2 

CL 

«— 1 

4- 

20 

r— » 

O 

2 

02 

—1 

< 

00 

CO 

000 

on 

UJ 

0 

2 

Ooo 

oc 

1— • 

2 

O 

O  LU 

3 

on 

OX 

Qo 

0 

O 

0 

4- 

0 

UJH 

4* 

LL 

0CO 

H 

o> 

00 

O 

O 

00 

LL 

LL 

LL 

LL 

r—  LL 

2 

O' 

30 

LL 

2 

CL 

on 

in> 

vO 

LLhh 

«— 1 

LL 

• 

CO 

r- 

0 

40 

O 

LL  GO 

r- 

O' 

LU 

r- 

C 

OsO 

• 

• 

COO 

• 

K 

r-4 

r*H 

CM 

mZ 

4- 

2 

CL 

on< 

LU 

LU 

>a_ 

O 

CD 

00 

a: 

2 

CO 

LU 

ro 

20 

Cl 

2 

CO 

104 


PERCENTAGE  OF  WORDS  LONGER  THAN  2  SOUNDS  WITH  BEGINING  3-SOUND  COMBINATIONS: 
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PERCENTAGE  OF  WORDS  LONGER  THAN  3-SOUNDS  WITH  BEGINING  4-SOUND  COMBINATIONS: 
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PERCENTAGE  OF  WORDS  LONGER  THAN  l-SOUND  WITH  ENDING  2-SOUND  COMBINATIONS 
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PERCENTAGE  OF  WORDS  LONGER  THAN  2  SOUNDS  WITH  BEGINING  3-SOUND  COMBINATIONS 
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PERCENTAGE  OF  WORDS  LONGER  THAN  3-SOUNDS  WITH  BEGINING  4-SOUND  COMBINATIONS: 
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PERCENTAGE  OF  WORDS  WITH  LAST  SOUND  AS  NOTED 
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PERCENTAGE  OF  WORDS  LONGER  THAN  1-SOUND  WITH  ENDING  2-SOUND  COMBINATIONS 
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PROBABILITY  MATRIX  FOR  FIRST  SOUNDS: 
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VALIDITY  FACTOR  PRODUCTS 
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COMPUTER  PROGRAM  NUMBER  1 

***  *****  **  *  *******  *********  4##  ***************  ***##*  ***  *#«*** 


//COOK  1  JOB  (C405t 1404, MA52 ) ,’SMC  2737  * ,TI ME= ( 00 , 40 ) 

//  EXEC  ALGOL W 

//SYSPRINT  DD  SYSOUT=A , DCB=BLKS I ZE=665 , SPACE= ( CYL , ( 3, 1 ) ) 
//ALGOL. SYSIN  DD  * 

SALGOL  T=035,P=80 

BEGIN 

COMMENT  INPUT  DATA  WRITTEN  IN  "ROMI”  WITH  AT  LEAST  ONE  BLANK 
BETWEEN  WORDS.  WORD  MUST  START  AND  END  ON  THE  SAME  CARD.  ; 
STRING  (2)  ARRAY  ROMI  (  1: :  101» 1 : : 2) ; 

STR I NG ( 80 )  TEXT; 

INTEGER  INDEX  t POI NTER , Z , TP . WP ,N ; 

REAL  TWOCOUNT ,THREECOUNT» FOURCOUNT, Q 5 
ST  RI NG ( 36 )  WOR  C»  CODE ; 

STRING  (1)  ARRAY  CLASS(l::5)  ; 

REAL  ARRAY  S DUN  DON E ( 1 : : 5 ) ; 

STRING  (2)  ARRAY  SOUNDTWO ( 1 : : 25 ) ; 

REAL  ARRAY  SOUNDTWOS TAT ( 1 :  : 25 )  ; 

REAL  ARRAY  SOUN DTHREESTAT ( 1 : : 125 ) ; 

STRING (3 )  ARRAY  SOUNDT HR EE ( 1 : : 1 2 5 ) ; 

REAL  ARRAY  SOUNDFOURST AT  ( 1  :  c  62 5 ) ; 

STR I NG ( 4 )  ARRAY  SOUNDEOUR ( 1 : : 62 5 ) ; 

REAL  ARRAY  VECTWO  (1  :  :  5 ) ; 

REAL  ARRAY  VECTHREE  <1::5); 

INTEGER  EPOINT; 

INTEGER  HEADERCOUNT; 

ST  RING ( 80 )  ARRAY  HEADER  <  1 : : 10 ) ; 

REAL  ARRAY  ENDT WOST AT  <1::25); 

REAL  ARRAY  ENDONE  (1::5>; 

REAL  ARRAY  ENDVEC2  (  1 :  :  5  )  ; 

LOGICAL  FOUND  ?  BAD; 

REAL  ARRAY  STATMUX1  <1::5,1::5); 

REAL  ARRAY  PMUX1  ( 1 : : 5 1 1 : : 5  )  ; 

REAL  ARRAY  VECFOUR  < 1 :  : 5  )  ; 

REAL  B; 

REAL  ECOUNTt  E.ICOUNT; 
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COMMENT  **  ***********  *****  ************  ******  ***  ******  ****** 
PROCEDURE  INITIALIZE  RESETS  THE  VALUES  OF  THE  GLOBAL 
COUNTERS  USED  THROUGHOUT  THE  PROGRAM.  INITIALIZE  ALSO  READS 
AND  PRINTS  THE  DATA  SET  IDENTIFIER  "HEADER » "  SETTING 
"TEXT  POINTER,"  "TP,"  TO  80  CAUSES  THE  PROCEDURE  GETWORD 
TO  READ  THE  FIRST  DATA  CARD. 

******  *  ***  ***  **  ***  ****  *  *  *  ********  a*************  ******  *******  . 

C 0 M M c N T * ******* * ** ****** * *** * 

HEADERCOUNT  IS  THE  NUMBER  OF  CARDS  PRECEEDING  THE  DATA 
WHICH  DESCRIBE  THE  TYP E/ QU AL ITY/ ETC .  OF  THE  DATA  BUT  ARE 
NOT  TO  BE  REAC  AS  PART  OF  THE  DATA. 
*****************************; 


PROCEDURE  INITIALIZE: 
BEGIN 
G  :=  0; 

TWOCOUNT  :=  0; 
THPEECOUNT  :=0; 
FOURCOUNT  :=  0; 

E  ICOUNT  :=  0: 

ECGUNT  :=  0; 


INDEX: =0 ; 

READ  (HEADERCOUNT)  ; 

FOR  I  :=  I  UNTIL  HEADERCOUNT  DO 
BEGIN 

HEADER  (I)  :=  " 

READCAP  C  ( HEA  DER ( I  )  ); 

WRITE  (HEADER ( I)  ) ; 

END; 

10 CO NT  POL  (2  )  ; 

WRITEC'  "); 

IOCONT  POL  (2); 

TP  :=  80 

END  INITIALIZE; 
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COMMENT  *  **#*#*£*  *£*#**  *:**£*£:£* £**#:*  #**£*$#£*  *$***##*:**#*:** 
PROCEDURE  GENERATETWO  PROVIDES  THE  25  TWO-SOUND  CLASS  LABEL 
DESIGNATIONS  IN  ALPHABETICAL  ORDER.  THESE  LABELS  ARE 
STORED  IN  THE  ONE  DIMENSION  ARRAY  "SOUNDTWO."  THE  LABELS 
CORRESPOND  TO  THE  25  ROWS  OF  THE  REAL  ARRAY  "SOUNDTWOSTAT." 

£  sjc:*#**;)!*:*  #£*£#**:***#*  :)c  sjc  *  fc**#  :jc  *:*:{:  sjc  ##  j*:  jje  ***  $  *$*:(£  *  ; 

PROCEDURE  GENERATETWO; 

BEGIN 
INTEGER  Y; 

Y  !  -  1  * 

FOP  I  :  =  1  UNTIL  5  DO 
BEGIN 

SOUNCONE ( I )  :=  0; 

FOR  J  : =  1  UNTIL  5  DO 
BEGIN 

SOUNDTWO  (Y)(Ol)  :=  CLASS  (I); 

SOUNDTWC  ( Y  )  (  I  1 1 )  :=  CLASS  (J)  *, 

SOUNDTWOSTAT(Y)  :=  0; 

Y  :=  Y  +  I 
END; 

END; 

END  GENERATETWO; 


COMMENT  $*******$************************************:*:**:*** 
PROCEDURE  GENERATETHREE  TAKES  THE  25  ELEMENT  ARRAY 
'•SOUNDTWO"  AND  GENERATES  THE  125  POSSIBLE  COMBINATIONS  OF 
THE  FIRST  THREE  SOUNDS  OF  A  WORD  IN  ALPHABETICAL  ORDER. 

THE  125  ELEMENTS  ARE  STORED  IN  THE  ONE  DIMENSIONAL  ARRAY 
"SOUNDTHREE"  AND  ARE  USED  AS  LA3ELS  FOR  THE  125  ROWS  OF  THE 
REAL  ARRAY  "SOUNDTHREE  STAT.  .. 

**  *  *  *  *  ****  *********  ****  *  *  *  *  *****  *  *****  *  ********  *  *  ********  *  *  ; 

PROCEDURE  GENERATETHREE; 

BEGIN 
INTEGER  Y; 

Y  :  -  1  * 

FOR  I  :=  1  UNTIL  25  DO 
BEGIN 

FOR  J  :=  1  UNTIL  5  DO 
BEGIN 

SOUNDTHREE!  YMO  12  )  :=  SOUNDTWO!  I  )  ; 

SOUNOTHREE(Y) (2|1)  :=CLASS(J); 

SOUNCTHREESTAT! Y  )  :=  0; 

Y  :=  Y  +  l 
END; 

END; 

END  GENERATETHREE; 


PROCEDURE  GENERATEFOUR; 

BEGIN 

INTEGER  Y; 

Y  •  -  1  * 

FOR  I  :=  1  UNTIL  125  DO 
BEGIN 

FOR  J  :=  1  UNTIL  5  DO 
BEGIN 

SOUNDFOUR(Y) (013)  : =SOUNDTHREE ( I )  ; 
SOUNDFOUR(Y) (3  I  1 )  :=CLASS(J); 
SOUNDFOUPSTAT! Y)  :=  0; 

Y  :=  Y+l 
END; 

END; 

END  GENERATEFOUR; 
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COMMENT  #£#£:*****  #$**#*#***:***  **5f-***##*j!tA^j|tie!{t  ###:**:{:*$:**** 

PROCEDURE  GETARRAY  READS  IN  THE  101  CHARACTERS  USED  BY  THIS 
PROGRAM  TO  REPRESENT  THE  INTERNATIONAL  PHON=TIC  ALPHAS  ET . 
EACH  SOUND  REPRESENTATION  IS  PLACED  IN  A  TWO  COLUMN  ARRAY 
"ROMI . "  COLUMN  ONE  IS  THE  SOUND  REPRESENTATION  AND  COLUMN 
TWO  CONTAINS  THE  CLASS:  EITHER  NASAL,  FRICATIVE,  STOP,  OR 
VOWEL.  THE  NUMBER  OF  CARDS  USED  TO  READ  IN  "ROMI"  CANNOT 
BE  VARIED  WITHOUT  ALTERING  THE  PROGRAM.  THE  NUMBER  OF 
ELEMENTS  IN  "ROMI"  IS  NOT  VARIABLE.  "ROMI"  IS  PRINTED  OUT 
A  CARD  AT  A  TIME,  JUST  AS  IT  IS  TYPED  IN  THE  DATA  CARDS. 

***  S  **  :*  ##***£$*****  I***##:!!**  # $  *  £  :Jc $  ###***  **  jjc*  jjt  *  #  ; 

PROCEDURE  GETARRAY; 

BEGIN 

STRING ( 81 )  PHONOL; 

INTEGER  B, A ; 

B:  =  l; 

A  :  =  1 ; 

WRITE  (  "'ROMI*  IS  THE  PHONETIC  CHARACTER  RF°R 5S ENTATI ON 
USED  IN  THF  IMPLEMENTATION  OF  THIS  PROGRAM"); 

WRITE  ("  ">  ,* 

IOCONTROL  (2) ; 

WHILE  (A  <=  27)  DO 
BEGIN 

PHONOL  :=  " 

READCARD  (PHONOL); 

WRITE( PHONOL) ; 

TP : =0; 

WH I L E ( TP  <=  79)00 
BEGIN 

WHILE  (PHONOL  (TP|1)  ="  »)  AND  (TP<=79)  DO 
TP : =  TP+1 ; 

IF  (  TP  ->=  80  )  THEN 
BEGIN 

IF  (PHONOL  (TP  +  1  |  1  )-,="-«  )  THEN 
BEGIN 

ROMI  ( B  ,  1 ) :  =  PHONOL  <  T P  |  1 )  ; 

TP:  =  TP+2; 

END 

ELSE 

BEGIN 

ROMI  ( B, 1 ) ( 0 | 1 ) :  =  PHONOL  (TP|1); 

ROMI  ( B , 1 ) ( 1 j 1 ) :  =  PHONOL  (TP+211); 

TP: =TP+4; 

END ; 

ROMI  ( B , 2 ) :  =  PHONOL  ( TP  |  1 )  ; 

B:  =  B+l; 

TP  :=  TP+1 

END 

ELSE 

A  :=  A  + 1 ; 

END 

END; 

TP  :  =  80 
END  C-ETARPAY; 
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COMMcNT  ***** *************  ******  ****  *********  ************** 
LOGICAL  PROCEDURE  GETWORD  READS  THE  INPUT  DATA  WRITTEN  ON 
"ROMI"  INTO  A  BUFFER  CALLED  "TEXT,”  ONE  CARD  AT  A  TIME  AND 
ECHO  PRINTS  EACH  CARD  WITH  A  SEQUENTIAL  NUMBER  IDENTIFIER 
"INDEX."  THE  BUFFER,  "TEXT,"  IS  SCANNED  ONE  CHARACTER  AT 
A  TIME  UNTIL  A  "WORD"  IS  BUILT.  LEADING  AND  TRAILING 
BLANKS  DELIMIT  A  "WORD."  "WORD"  IS  THE  GLOBAL  PARAMETER 
OF  STRING  TYPE  OF  MAXIMUM  LENGTH  OF  36  CHARACTERS  WHICH 
IS  DELIVERED  TO  THE  MAIN  PROGRAM  AND  SUBSEQUENT  PROCEDURES. 
WHEN  THE  CHARACTER  » %"  IS  ENCOUNTERED  ON  A  DATA  CARD  IN 
OTHER  THAN  CARD  COLUMN  1  THE  LOGICAL  VALUE  OF  "GETWORD"  IS 
SET  TO  FALSE  THUS  SIGNALLING  THE  END  OF  A  DATA  SET. 

"WORDS"  MUST  START  AND  END  ON  THE  SAME  CARD. 

*******  ********  **** **  ******  **********  ********  *  **  ******  ***  **  ; 

LOGICAL  PROCECURE  GETWORD  <STRING(36)  RESULT  WORD): 

BEGIN 
W  P  *  —  Q  • 

I F( T P=80 )  THEN 
BEGIN 

INDEX: = INDEX+ 1 ; 

R.EADCARD  (TEXT)  ; 

WRITE( INDEX, "  ",  TEXT  )  ; 

TP :  =0 
END; 

WHI LE (TEXT ( TP |1  >="  ")  DO 

BEGIN 

TP: =TP  +  1 ; 

IF  (TP=80)  THEN 
BEGIN 

READCARD  (TEXT); 

INDEX: =INDEX+1 ; 

WRITE  (INDEX,"  ",  TEXT); 

TP  :  =0 

END 

END; 

WORD :=  "  "; 

WHILE  <TP<80)  AND  (TEXT(TP|  !)-<="  ")  DO 
BEGIN 

WORD  (WP|1) :=  T  EXT ( TP  I  1  )  ; 

WP  :=  WP  +  1; 

TP  :=  TP  +  I 
END; 

WP  :=  WP-l; 

WORD->="  ?" 

END  GETWORD; 
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COM  HEN  7  ***#*£*£#*#*£*#£*:***:£#**  3*  *2*  £***#*#*  *  *  *  *  £ 

PROCEDURE  "CONVERT"  OPERATES  ON  THE  GLOBAL  VARIABLE  "WORD" 
OBTAINED  FROM  "GET  WORD,"  "WORD"  IS  SCANNED  CHARACTER  BY 
CHARACTER  TO  OBTAIN  EACH  "ROMI"  SOUND  REPRESENTATION. 

A  BINARY  SEARCH  OF  THE  ARRAY  "RGM I "  IS  CONDUCTED  Tn 
08TAIN  THE  SOUND  CLASS  EQUIVALENT  FOR  EACH  "SOUND"  OF 
"WORD."  A  NEW  STRING,  "CODE,"  IS  GENERATED  FROM  THE  FOUR 
SOUND  DESIGNATIONS  CORRESPONDING  TO  THE  INTERNATIONAL 
PHONETIC  ALPHABET  SPELLING  OF  "WORD."  "CONVERT"  IS  ALSO 
CAPABLE  OF  EDITING  INPUT  DATA  AND  PRINTS  OUT  A  WARNING 
LABEL  ALONG  WITH  THE  PARTICULAR  "SOUND"  REPRESENTATION 
WHICH  DID  NOT  APPEAR  IN  "ROMI"  AND  THE  "WORD"  IN  WHICH  THc 
ERROR  OCCURRED.  A  SAMPLE  OUTPUT  FROM  "CONVERT"  WOULD  BE 
"FVNSFVNN."  THE  COUNTER  " Z"  IS  ALSO  A  GLOBAL  VARIABLE, 

AND  TELLS  THE  NUMBER  OF  IPA  SOUNDS  IN  EACH  STRING  "CODE." 
**********************  ***********  **************************  ; 

PROCEDURE  CCNVERT  ( STRING ( 36 )  RESULT  CODE); 

BEGIN 

STRING (2)  SOUND; 

INTEGER  I, TEST; 

INTEGER  LISTEND,  LISTBEGIN,  POINTER; 

CODE  :  =  "  "; 

BAD  :=  TRUE; 

I  :  =  0  ; 

Z  :  =  0; 

WHILE  i<=WP  DO 
BEGIN 

TEST  :=  0; 

FOUND  :  =  TRUE ; 

SOUND: ="  "; 

IF  WORD  Iltlll)-.*  THEN 

BEGIN 

SOUND  :  =  WORD  (III); 

I := 1+1 ;  END 

ELSE 

BEGIN 

SOUNO( 011):=  WORD( III); 

SOUND( 1 | 1) : =  WORD( I  +  2|  1  )  ; 

I :=I+3 
END; 

LI STBEGIN: =  1; 

LISTEND  :=  101; 

POINTER  :=  LISTEND; 

WHILE  (  FOUNC  )  DO 
BEGIN 

TEST  ; =  TCST  + 1  * 

IF  SOUND  <  ROMf ( POINTER, 1 )  THEN 
L I STEND : =  POINTER; 

IF  SGUND  >  ROMI ( POINTER, 1)  THEN 
LI STBEGIN: =  POINTER; 

IF  (  SOUND  =  ROMI  (  POINTER, 1  ) ) THEN 
FOUND  :=  FALSE 
E  LS  E 
BEGIN 

IF  (TEST  =  12  )  THEN 
BEGIN 

WRITE!" ERONEOUS  DATA", "WORD  =  ",WORD,  "SOUND  =  ",  SOUND); 
FOUND  :=  FALSE; 

I  :=  WP+l; 

BAD  :=  FALSE 
END; 

POINTER  :=  TRUNCATE  (  (  LISTEND  +  LISTBEGIN  )  /  2  ) 

END*, 

END; 

CODE  (  Zll  )  :=  ROMI  (  POINTER,  2  )  (Oil  ); 

Z  :=  Z+l  ; 

IF  (BAD  =  FALSE)  THEN 
Z  :=  0; 

END 

END  CCNVERT; 
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COMMENT  *************  *  *  ***  ****  **  *  *********************  ***** 
PROCEDURE  "ONESOUND"  LOOKS  AT  THE  FIRST  SOUND  OF  A  STRING 
OF  "CODE"  AND  KEEPS  A  RUNNING  TALLY  OF  THE  TOTAL  NUMBER 
OF  OCCURANCES  OF  EACH  SOUND  CLASS  DESIGNATION  IN  POSITION 
ONE  OF  EACH  STRING  OF  "CODE."  A  COUNTER  "Q"  TALLIES  THE 
TOTAL  NUMBER  OF  WORDS  IN  EACH  DATA  SET  (SAMPLE).  THE  DATA 

OBTAINED  IS  STORED  IN  THE  REAL  ARRAY  "SOUNDONE."  A  31  NARY 

SEARCH  IS  USED  TO  MATCH  THE  FIRST  SOUND  OF  A  WORD  WITH  THE 
FIVE  POSSIBILITIES.  THE  LIST  "POINTER"  USED  IN  THE  BINARY 
SEARCH  IS  A  GLOBAL  VARIABLE  AND  IS  USED  AS  A  STARTING  POINT 
FOR  THE  SEARCH  OF  THE  LIST  OF  ALL  POSSIBLE  LEADING  TWO 
SOUND  COMBINATIONS. 

************* ** ***** ************************* ************** ; 

PROCEDURE  ONESOUND; 

BEGIN 

INTEGER  LISTBEGIN,  L I  ST  END; 

Z  :  =  2.  —  I  * 

I F ' ( Z  >=  0)  AND  (BAD  =  TRUE)  THEN 
BEGIN 

Q  ;=  Q  +  l; 

LISTBEGIN  :=  1; 

LISTEND  :=  5; 

POINTER  :=  LISTEND; 

WHILE  ( CODE (Oil)  CLASS  (POINTER))  00 
BEGIN 

IF  (CODE(OII)  <  CLASS  (POINTER))  THEN 
LISTEND  :=  POINTER; 

IF  (CODE  (Oil)  >  CLASS  (POINTEP))  THEN 
LISTBEGIN  :=  POINTER; 

POINTER  :=  (LISTBEGIN  ♦  LISTEND)  DIV  2 
END; 

SOUNDONE  (POINTER)  :=  SOUNDONE  (POINTER)  +  1 
END 

END  ONESOUND; 
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COMMENT  **####*#£***$  *  ****  *###**  ##*:*#^  #  a*##  *  **  ^c^c  #  #  #  *  £  £  #*  ## 

PROCEDURE  "TWOSOUMD"  DOES  AN  ANALOGOUS  OPERATION  ON  THE 
FIRST  TWO  SOUNDS  OF  A  WORD  AS  "ONESOUNC"  PERFORMED.  BY 
REMEMBERING  THE  VALUE  OF  THE  LIST  "POINTER"  cRON  "CNESOUND" 
WE  NEED  ONLY  SEARCH  THE  APPROPRIATE  PORTION  OF  th  =  ARRAY 
"SOUNDTWO"  (FIVE  ELEMENTS)  FOR  THE  SECOND  SOUND.  THE 
NUMBER  OF  OCCURANCES  OF  EACH  COMBINATION  IS  STORED  IN  THE 
REAL  ARRAY  »S OUNDT WOS T AT . "  "TWO  COUNT"  TALLIES  THE  NUMBER 

OF  WORDS  FROM  THE  DATA  SET  WITH  TWO  OR  MORE  SOUNDS. 

#$**:*:{££**  £  *  *  *  *  £  :$e  *  *  **  *  *  *3*  *  **  £  **  *  A*  *  ; 

PROCEDURE  TWOSOUND; 

BEGIN 

INTEGER  LISTBEGIN,  LISTEND; 

IF  (Z>=  2)  THEN 
BEGIN 

TWOCOUNT  :=  TWOCOUNT  +  1; 

LISTBEGIN  :=  P0!NTER*5-4  ; 

LISTEND  :=  LISTBEGIN  +  4; 

POINTER  :=  LISTEND; 

WHILE  ( CODE ( 0  |  2  )  -**  SOUNDTWO(  POINTER  )  )  DO 
BEGIN 

IF  (CODE  (012)  <  SOUNDTWO  (POINTER))  THEN 
LISTEND  :=  POINTER; 

IF  ( COO  E 10  I  2 )  >  SOUNDTWO  (POINTER))  THEN 
LISTBEGIN  :=  POINTER; 

POINTER  :=  (LISTBEGIN  +  LISTEND)  DIV  2 
END  ; 

SOUNDT WOST  AT  (POINTER)  :=  SOUNDTWOSTAT  (POINTER)  +  1 
END; 

END  TWOSOUND; 
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COMMENT  *##****$*£*££5} c#*** 

THE  OPERATION  OF  PROCEDURES  THREESGUND  AND  FOUR- 
SOUND  IS  IDENTICAL  TO  PROCEDURE  TWOSOUND. 

REMEMBERING  THE  VALUE  OF  THE  LIST  POI NT  FR  REQUIRES  A  SEARCH 
OF  ANLY  FIVE  ELEMENTS  FOR  EACH  SUBSEQUENT  SOUND  OF  A  STRING 
"CODE."  THE  CORRESPONDING  ARRAYS  ARE  "SOUND THREES TAT” 

AND  "SOUNDFOURSTAT."  COUNTERS  FOR  EACH  CATEGORY  OF 
LENGTH  ARE  “THP EECOUNT"  AND  "FOURCOUNT." 
*##*******#$**#*£*****$$**£ 

PROCEDURE  THREESOUND; 

BEGIN 

INTEGER  LISTBEGIN,  LISTEND; 

IF  (  Z>=  3  )  THEN 
BEGIN 

THREECOUNT  :=  THREESOUMT  +  1  ; 

LISTBEGIN  :=  POINTER*5-4; 

LISTEND  :=  LISTBEGIN  +  4  ; 

POINTER  :=  LISTEND; 

WHILE  ( CODE ( 0 | 3 )  -.=  SOUNDTHREE ( PQ INTER ) )  DO 
BEGIN 

IF  ( CO DE (0 1  3 )  <  SOUNDTHREE(PQINTER) )  THEN 
LISTEND  :=  POINTER; 

IF  (  CODE ( 0  I  3 )  >  SOUNDTHREE ( POINTER) )  THEN 
LISTBEGIN  :=  POINTER; 

POINTER  :=  ( LI STBEG !N+L I STEND ) DI V  2 
END ; 

SO UN CT HR EES T AT ( POI NT 5°  )  :=  S OUNDT HR EEST AT ( POI NTER  )  +  1 

END; 

END  THREESOUND? 


PROCEDURE  FOURSOUND; 

BEGIN 

INTEGER  LISTENDtLI STBEGIN; 

IF  (  l  >=  4)  THcN 
BEGIN 

FOURCOUNT  :=  FOURCOUNT  +  1; 

LISTBEGIN  :=  POINTER  *5-4; 

LISTEND  :=  L I ST  BEG  I N  +  4; 

POINTER  :=  LISTEND; 

WHILE  (CODE  (0(4)  -=  SOUNDFOUR  (POINTER))  DO 
BEGIN 

IF  (CODE  (014)  <  SOUNDFOUR  (POINTER  ))  THEN 
LISTEND  .*=  POINTER; 

IF  (CODE  (014)  >  SOUNDFOUR  (POINTER))  THEN 
LISTBEGIN  :=  POINTER; 

POINTER  :=  (LISTBEGIN  +  LISTEND)  DIV  2 
END  ; 

SOUNDFOURST AT  (POINTER)  :  =  SQUNDFOURSTAT  (POINTER)  +  1 
END; 

END  FOURSOUND; 
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PROCEDURE  ENOS OUNDQNE ; 

BEGIN 

INTEGER  LISTENO, LISTBEGIN; 

IF  ( Z>  =  4 )  AND  ( BAD=  TRU  5 )  THEN 
BEGIN 

ECOUNT  :=  ECOUNT  +  I; 

LISTBEGIN  :=  1; 

LISTENO  :=  EPOINT  :=  5  ; 

WHILE  ( COOE( Z| 1>-=CLASS( EPOINT) )DO 
BEGIN 

IF  (CODE  (ZllKCL  ASS  (EPOINT)  )  THEN 
LISTENO  :=  EPOINT; 

I F ( CODE ( Z  |  1  )>CLASS ( EPOINT )  )THEN 
LISTBEGIN  :=  EPOINT; 

EPOINT  :=(LISTEND+LISTBEGIN)DIV  2 
END; 

ENDCNE (EPOINT)  :=  ENDONE ( EPOI NT ) + 1 
END; 

END  ENOSOUNDONE; 


PROCEDURE  ENDSOUNDTWO  ; 

BEGIN 

INTEGER  LISTENDtLlSTBEGIN; 

STR  I NG ( 2 )  SOUND; 

I  F ( Z>=  5 ) THEN 
BEGIN 

E_1C0UNT  :=  E  1COUNT  +  I; 

LISTBEGIN  :=  FP0INT*5-4; 

LISTEND  :=  EPOINT  ;=  L I STBE  GIN+4 ; 

SOUND ( 0  I  1) :=CODE(Z| 1) ; SOUND ( 1  ID  :=C0DE( Z-ll 1) 
WH I  LS(  S  OUND  (  0  |  2)-'=SOUNDTWO(EPOINT  )  )  DC 
BEGIN 

IF ( SOUN D( 0 1  2) <SO UN DTWO(E POINT) )  THEN 
LISTENO  :=  EPOINT; 

IF( SOUND(Ol 2) >SOUNDTWO( EPOINT) )  THEN 
LISTBEGIN  :=  EPOINT; 

EPOINT  :=(LISTEND+LISTBEGIN)DIV  2 
END  ; 

ENDTWOSTAT( EPOINT) : =ENDTWOST AT ( EPO INT ) +1 
END; 

END  ENDSOUNDTWO; 
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PR0C5DUFE  printone; 

BEGIN 

IOCGNTRCL  (B); 

WRITE  ("PERCENTAGE  OF  WORDS  WITH  FIRST  SCUND  AS  NOTEO."); 
WRITE!"  "I; 

WRI TE( "NUMBER  OF  WORDS  IN  SAMPLE  OF  STATED  LENGTH  IS  :"); 
WRITE(Q) ; 

WRITE!"  '•); 

FOR  I  5=1  UNTIL  HEADERCOUNT  DO 
WRITE  (HEADER! I)); 

WRITE!"  "); 

FOR  I  : =  1  UNTIL  5  DO 

WRITECN  ("  ''.CLASS  (  I  ) » "  ")  ; 

IOCONTPOL  (2); 

WRITE!"  "); 

IOCONTPOL  (2); 

FOR  J  :=  1  UNTIL  5  DO 
BEGIN 

IF  ( SOUNDON  E ( J )  =  0  ) THEN 
WRI TEON (S CUN DONE ( J ) ) 

ELSE 

BEGIN 

S  OUNDCNE ( J )  :=  SOU NOON E ( J )/Q*IOO ; 

STATMUX1  (N,J)  :=  SOUNDONE(J); 

WRITEON(SOUNDONE(J ) ) 

END; 

END  ; 

END  PRINTONE; 
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PROCEDURE  PRINTTWO; 

BEGIN 
INTEGER  P; 

I3C0NTR0L  (3J; 

WRITE!"  PERCENTAGE  OF  WORDS  LONGER  THAN  ONE  SOUND  WITH 
BEGINING  TWO-SOUND  COMBINATIONS  AS  NOTED."); 

WRITE!"  ")*, 

WRI TE( "NUMBER  OF  WORDS  IN  SAMPLE  OF  STATED  LENGTH  IS  :»); 
WRITE(TWOCOUNT) ; 

WRITE!"  "); 

FOR  I  :  =  I  UNTIL  HEADERCO'JNT  DO 
WRITE  ( HEADER ( I ) ) ; 

WRITE! "  "); 

FOR  K  :=  1  UNTIL  5  DO 
BEGIN 

IOC3NTRCL  (2); 

FOR  L  :=  4  STEP  -I  UNTIL  0  DO 
BEGIN 

WR I T  EON ( "  ", SOUNDTWO(K*5-L)  , "  "); 

END ; 

IOCONTROL  ( 2)  ; 

WRITE!"  "); 

IOCONTROL  (2); 

FOR  M  :=  4  STEP  -I  UNTIL  0  DO 
BEGIN 

p  •  —  J$c  -  — • 

IF'(SOUNDTWOSTAT(P)  =  0)  THEN 
WRI T  EON ( SOUNDTWOST  AT ( ° )  ) 

ELSE 

BEGIN 

SOUNDTWOST AT ( P)  :=  (SOUNDTWOST AT ( P)/TWCCCUNT ) *  100  ; 

VECT  WO  (  5-M )  :=  VEC  TWO  (  5-M )  SOUNDTWOST  AT  (  P  )  ; 

WR I T EON ( SOUNDTWOST AT (P ) ) 

END; 

END  ; 

END; 

WRITE!"  "); 

WRITE  ("  ")  ; 

IOCONTROL  (2)  ; 

WRITE! "ANALYSIS  OF  SECOND  SOUND  AS  THE  PERCENTAGE  OF"); 
WRI TE ("OC CURA NCE  IN  THE  SECOND  POSITION."); 

WRITE!"  "); 

IOCONTROL  (2)  ; 

FOR  J  :=  1  UNTIL  5  DO 
WRI T  EON ( VECTWO ( J) ) 

END  PRINTTWO; 
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PROCEDURE  PRINTTHREE; 

BEGIN 

INTEGER  P; 

IOCDNTROL  (3); 

WRITE!"  PERCENTAGE  OF  WORDS  LONGER  THAN  TWO  SOUNDS  WITH 
BEGINING  THREE-SOUND  COMBINATIONS  AS  NOTED."); 

WRITE!"  "); 

WRITE! "NUMBER  OF  WORDS  IN  SAMPLE  OF  STATED  LENGTH  IS  :") 
WRITE(THREECOUNT); 

WRITE!" 

FOR  I  :=  1  UNTIL  HEADERCOUMT  DO 
WRITE  ! HEADER ( I ) ) ; 

WRITE!"  " ) ; 

FOR  I  :=  1  UNTIL  25  DO 
BEGIN 

IOCDNTROL  (2); 

FOR  J  :=  4  STEP  -1  UNTIL  0  DO 

WRITEON!"  ", SOUNDTHREE! I*5-J) , "  "); 

IOCONTRCL  (2) ; 

WRITE!  "  ")  ; 

IOCDNTROL  (2) ; 

FOR  M  ;=  4  STEP  -1  UNTIL  0  00 
BEGIN 

P  ; =  I *5-m * 

IF! SOUNCTHREESTAT!  P)  =  0)  THEN 
WRITECN(S0UNCTHREESTAT( P ) ) 

ELSE 

BEGIN 

SOUNCTHREESTAT! P )  :=  (SOUNDTHREESTAT(P) 

/THREE COUNT ) *  100 ; 

VECTHREE ( 5-M)  :=  VECTHREE l 5-M )  +  SOUNDTHREEST AT ( P  ) ; 

WRITEON (SOUNDT HR EEST AT( P ) ) 

END; 

END; 

IOCDNTROL  (2); 

WRITE!"  ") 

END; 

WRITE! »  ") ; 

WRITE!"  ") ; 

IOCDNTROL  (2)  ; 

WRITEC'ANALYS IS  OF  THIRD  SOUND  AS  TO  THE  PERCENTAGE  OF") 
WRITEC'OCCURANCE  IN  THE  THIRD  POSITION."); 

WRITE!"  »); 

IOCONTROL  (2)  ; 

FOR  K  :=  1  UNTIL  5  DO 
WRITEON  (VECTHREE! K) ) 

END  PRINTTHRFE; 
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PROCEDURE  PRINTFOUR; 

BEGIN 
INTEGER  P; 

IOCONTROL  (3); 

WRITE! "PERCENTAGE  OF  WORDS  LONGER  THAN  THREE  SOUNDS  WITH 
BEGINING  FOUR-SOUND  COMBINATIONS  AS  NOTED."); 

WR I T  E  ! "  " ) * 

WRI T£< "NUMBER  OF  WORDS  IN  SAMPLE  CF  STATED  LENGTH  IS  :"); 
WRITE! FCURCOUNT  )  ; 

WRITE!"  " ) * 

FOR  I  :=  l’uNTIL  HEADERCOUNT  DO 
WRITE  ( HEADER ( I ) ) ; 

ypTTS  (n  ii  \  • 

FOR  I  :=  I  UNTIL  125  DO 
BEGIN  IOCONTROL  (2); 

FOR  M  :=  4  STEP  -1  UNTIL  0  DO 
BEGIN 

P  :  =  I  *  5  -  M ; 

IF  ( SOUNDFOURSTAT  (P)  =0  )  THEN 

ELSE 

BEGIN 

SOUNDFOURSTAT  (P)  :=  ( SOUNDFOURST AT  (P)  /  FOURCOUNT  )  *  100 

WRITE! SOUNDFOUR! P ) , SOUNDcOU RSTAT ( P ) ) ; 

WRITE!"  ") 

END; 

END; 

END; 

END  PRINTFOUR; 


PROCEDURE  PRINTSND; 

BEGIN 

IOCONTROL  ( 3) ; 

WRITE  <"  PERCENTAGE  OF  WORDS  WITH  LAST  SOUND  AS  NOTED. " )  ; 
WRITE!"  "); 

WRITE! "NUMBER  OF  WORDS  IN  SAMPLE  CF  STATED  LENGTH  IS  :"); 
WRITE! ECOUNT ) ; 

WRI Tc( "  " ) ' 

FOR  I  :=  l’u.NTIL  HEADERCOUNT  00 
WRITE! HEADER! I ) ) ; 

WRITE!"  ")*, 

FOR  I  :=  1  UNTIL  5  DO 

WRIT EON ( "  ",  CLASS (I),"  "); 

IOCONTROL  (2); 

WRITE!"  "); 

IOCONTROL  (2); 

FOR  J  :=  1  UNTIL  5  DO 
BEGIN 

IF! ENDCNE! J )=0 )  THEN 
WRI TEC N ( ENDONE ( J ) ) 

ELSE 

BEGIN 

ENDONE ( J )  :=  ENDONE ( J )/ ECCUNT*100 
WRI TECN ( ENDON  E { J ) ) 

END; 

END; 

END  PR  INTEND; 


146 


PROCEDURE  PRINTENDTWO; 

BEGIN 
INTEGER  P; 

I OCONT  POL  (3); 

WRITE!"  PERCENTAGE  OF  WORDS  LONGER  THAN  ONE  SOUND  WITH 
ENDING  TWO  SOUND  COMBINATIONS  AS  NOTED."); 

WRITE!"  "); 

WRI TE( "NUMBER  OF  WORDS  IN  SAMPLE  OF  STATED  LENGTH  IS 
WRITE! E.1COUNT ) ; 

WRITE!"  "); 

FOR  I  :=  1  UNTIL  HEADERCOUNT  DO 
WRITE ( F  EADER (  I )  ) ; 

WRITE!"  "); 

FOR  K  :=  1  UNTIL  5 
BEGIN 

I OCCNT  POL  (2); 

FOR  L  : =  4  STEP  -1 
BEGIN 
WRITECN (" 

END; 

IOCGNTROL  (2); 

WRITE!" 

IOCONTRCL  (2)  ; 

FOR  M  :=  4  STEP  -1 
BEGIN 

P  :=  K*5-M; 

IF  ( ENDTWOST  AT  (  0 
WRITEON!  ENDTWC'STAT 
ELSE 
BEGIN 

ENDTWOST  AT ( P )  :=  ( ENDT WO  ST AT ( P ) / E  1C0UNT)*10D; 

ENOVEC2 ( 5-M ) : =ENDVEC2( 5-M ) +E NDTWOSTAT ( P ) ; 

WPITEON!ENDTWOST  AT  {  P ) ) 

END; 

END; 

END; 

WRITE!"  "); 

WRI T E ( "ANALYS IS  Op  NEXT  TO  LAST  SOUND  AS  tq  THE  PERCENTAGE") 
WRI T E( "OF  OCCURANCE  IN  THAT  POSITION."); 

WRITEC  "); 

I OCONT  POL  ! 2  )  ; 

FOP  J  :=  1  UNTIL  5  DO 
WRITEON! ENDVEC2! J)  ) 

END  PR  I NT ENDT WO  ; 


DO 

UNTIL  0  DO 

'SSOUNDTWO!  K*5-L)  ,  "  "); 

UNTIL  0  DO 

)  =  0)  THEN 
(P  )  ) 
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PRINTMUX1 ; 

1  UNTIL  5  DO 

1  UNTIL  N  DO 
STATMUX1(R,S) 


=  1 


(3) ; 

(2) ; 

PROBABILITY 


PROCEDURE 
BEGIN 
FOR  S  := 

BEGIN 
B  :  =  0  ; 

FOR  R  := 

BEGIN 
B  :=  B 
END; 

FOR  T 
BEGIN 
PMUX1 ( S  »T) 

END; 

END; 

WRITEC'  ") 

WRI T  E ( "  ") 

IOCONTPOl 
IOCONTROL 
WRITE  <" 

WRITE  ( "  ; 

IOCONTROL  (2) 

FOR  I  :=  1 
BEGIN 

WRIT  EON  (" 

END; 

IOCONTROL 
WRITE  ("  " 

IOCONTROL 
FOR  K  :=  1 
BEGIN 
IOCONTROL 
WRITE  ("  " 

WRIT?  (••  •• 

IOCONTROL 
FOR  J  ; =  1 
BEGIN 

WRITECN  (PMUXKJtKM 
END; 

END; 

END  PRINTMUXl; 


UNTIL  N  DO 

:=  ( STATMUX1 ( T»  S ) /B)  5 


MATRIX  FOR  FIRST  SOUNDS 


" ) ; 


UNTIL  5  DO 


CLASS  ( I  ) 


( 2)  ; 

) ; 

(2)  ; 
UNTIL 

(2)  ; 

) ; 

) ; 

(2) ; 
UNTIL 


5  DO 


N  DO 
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=  1  UNTIL  5  DO 


PROCEDURE  CLEARMEM; 
3EGIN  „  T 

INTEGER  Pt  S,  T ! 

P  :  =  0 
S  :=  0 
T  :=  0 
FOR  I 
BEGIN 

VECTWO(I)  :=  0  ; 
VECTHR EE ( I )  :=  0  ; 

vec four  (i)  •*=  0; 

ENCVEC2  ( I )  *•=  0; 
ENDGNE (I )  :■  C; 
SOUNCONE  (I)  •*=  0; 

FOR  J  :=  1  UNTIL  5  DO 
BEGIN 

P  :=  P  +  l; 

ENDTWCST AT  ( P )  :=  0; 


SOUNDTV.CSTAT  |P) 
FOR  K  :  =  1  UN  i  I L  5 
BEGIN 

S  :=  S  +  l; 
SOUNDTHREESTAT  (S) 
FOR  L  : =  I  UNTIL  5 
BEGIN 

T  :  =  T  +  1; 
SOUNDFOURST AT  (T) 
END; 

END  ; 

END? 

END; 

END  CLEARMEM; 


0 

DO 


:=  0 
DO 


:=  0 
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CLASS  (1)  :=  "F" ; 

CLASS  (2)  :=  "N" ; 

CLASS  (3)  :=  "P"; 

CLASS  (4)  :=  "S"; 

CLASS  (5)  :=  "V"; 
GENERATETWO; 
GENERATETHPEE; 
GENERATEFCUR; 

GETARRAY; 

N  :  =  1 ; 

WHILE  (  N  >  0)  DO 
BEGIN 

IOCONTRCL  (3); 

INITIALIZE; 

CLEARMEM; 

WHILE  (GETWORD  (WORD))  DO 
BEGIN 

CONVERT  (CODE); 

ONESOUND; 

TWO  SOUND; 

threescund; 

FOURSOUNO; 

ENDSOUNDGNE; 

ENDSOUNDTWO 

END; 

PRI NTONE  ; 

PRINTTWO; 

PRINTTHREE; 

PRINTFOUR; 

PRINTENC; 

PRINTENDTWO; 

READ  (N) 

END  ; 

N  :  =  5; 

PRI NTMUX1 
END. 


COM  M  ENT***************  *  **************  *  ******  *****  *$****#***:!: 
ROMI  MUST  ALWAYS  BE  THE  FIRST  DATA  SET  FOLLOWING  THE  "SDATA'1 
CARD  (JCL). 

NOTE  THAT  ROMI  IS  WRITTEN  SUCH  THAT  EACH  CHARACTER  OF  ROMI 
IS  FOLLOWED  IMMEDIATELY  (SEPARATED  BY  ONE  SPACE)  BY  THE 
CATEGORY  TO  WHICH  IT  BELONGS.  TWO  LETTERS  JOINED  BY  A  "-» 
ARE  READ  AS  A  SINGLE  LETTER. 

***********  ****  *** * ** ****  **  **  ** ** ****** *** *** *  ** * ** **** * ** *  ; 
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#  ajc  *£  #  ^c^e  3*:*  #  :$c  $:£  *  #± 

COMPUTER  program  NUMBER  2 


//COOK8  JOB  { 0405, 1404, MA52  ) « 1  SMC  2737'tTIMF=E 
//  EXEC  ALG0LW,REGI0N=350K 

// SYSPR INT  OD  SYSPUT=A,DCB=BLKSIZF=665,SPACE=(CYL,(3,1) ) 
//ALGOL. SYSIN  DO  * 

*ALGOL  T=300,P=?00 
SNOLIST 


BEGIN 

COMMFNT 

*************  **  ***  *********  ******  **  **  *  *******  ******  *****  **** 
*  **  ***  *  **  **********  ***************************  **  *  *  *  *  ****** 


FINA 

L  PROGRAM 

OUTLINF: 

1  . 

DIMENSION 

AREA 

2. 

PROCEDURE 

INITIALIZE 

3. 

«f 

SFTCOUNT 

4. 

ii 

GENERATETWO 

5. 

it 

GENERATETHR=F 

6. 

ii 

GFNERATEFOUR 

7. 

it 

GENERATRIX 

8. 

ii 

GpT  ARRAY 

9. 

LOGICAL  PROCEDURE  GFTW^rd 

10. 

PROCEDURE 

CONVERT 

1  1  . 

n 

PRog  TWO 

12. 

ii 

DEC  IDE 

(INCLUDES  PATHS 

FOR 

1  >  2  »  3  *4,  5?  6  . 

SOUND 

WD  PDC ) 

13. 

if 

COMPUTE  NORM 

14. 

ii 

RAW  ANALYSIS 

(INCLUDES  PATHS 

POQ 

1 ,2,3,4, 5,6, 

SOUND 

WORDS) 

15' 

ii 

COMPJT=  ANY 

16 

ii 

CLPARM=M 

17 

ii 

FINDSOUNO 

18 

it 

ONF  SOUND 

19 

ii 

GRAPH 

20. 

MAIN  PROGRAM 

***************  ******  ****************  ********  *  **  *  *****  ****** 
*  A****#  ****  **********************  ************  *************** 
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COMMF\'T  INPUT  DATA  WRITTEN  IN  "RON  I"  WITH  AT  LEAST  ONE:  BLANK 
BETWEEN  WORDS. 

WOcO  *UST  START  AND  END  ON  SAME  CARD.; 


: 101 . 1 : : 2  ) ; 
3  ) ; 


I NT EGER  DIM; 

INTEGER  SAMPLES  I Zc ; 

PROCEDURE  DUMMY; 

BCG  I N 

COMMENT  *****  GLOBAL  DECLARATIONS; 

INTEGER  TWOPD INT ,nNEPO!NT?SPACF; 

INTEGER  INDEX  ,  POINTER,  Z,TP,  yo,  HEA  DCOUNT , 
INTEGFR  FOURPOINT; 

integer  threepoint; 

INTEGER  N; 

STRING  (PO)  T  CX  T ; 

STRING  (36)  WOR O ,  CODF; 

LOGICAL  FOUND,  BAD; 

STRING  (?)  ARRAY  ROw I  (  l 
STP  I NG ( 6 )  ARRAY  CLASS(9: 

STRING (6)  ARRAY  SOUND T WQ ( 0 : ; 1 5 ) ; 

STRING  ( 6  )  ARRAY  SOUNDTH.R  FE  (  0  ;  :  63  )  ; 

ST  R I NG (6 )  AR°  AY  SOUNDFO'JR ( 0 : ; ? 55 ) ; 

STRING (6)  ARRAY  SOUND F I Vc ( 0 ; : l 02 3 ) ; 

S  TR I NG ( 6 )  ARRAY  SOUNDS  I  X ( 0 :: 4095 )  ; 

STRING  (80)  ARRAY  HEAonp  ( J ; ; \ 0 ) ; 

REAL  ARRAY  0NF1  ,  ONE  2 ,  0NC3  ,  OME4,  ONES  ,  0NE6  ,  F6 
REAL  ARRAY  0NFTWO2 , 0NETWO3 , 0NETW04, 0NCTW05 (0 
REAL  ARRAY  ONET  WO* ,  =  I  R  S  TL  A  S  T5  ,  F  !  R  STL  A  S  T6  (  0  :  ; 
REAL  ARRAY  Ec26(  0:  ;  1  6, 0;  ;DI  m  ) ; 

?EAL  ARRAY  0NETW033 .0NETW034 ,0NETW035 , 

ONFT  W03  6(0; ;64,0; :DIM) ; 

REAL  ARRAY  ON  c=T  W03  4  4 ,  D  Nc  TWO  345  ,  QNF  T  WO  346  , 

El 2EE26( 0: : 256,0 : :DI ”) ; 

R -  A  L  ARRAY  0NETWC3^E6,0NCTW034E5( 0: ;  1024,0: ; 
REAL  ARRAY  A L L SOUND ( 0 : : 4 095 , 0 : ; D I M ) ; 


F POINT,  NUMBER: 


(  0  :  :  4  ,  o  :  :  n  T  M  ) : 
:  :  1 6  •  0 :  :  o  I  M  )  : 
l  6 , 0 :  :  D  I M )  ; 


DIM); 
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COMMENT  *************  *  ****  *******************  ************** 

PROCEDURE  INITIALIZE  RESETS  THE  VALUES  OF  TH=  GLOBAL 
COUNTE RS  USED  THROUGHOUT  THE  PROGRAM.  INITIALTZE  ALSn  READS 
AMO  PRINTS  THE  DATA  SET  IDENTIFIER  "HEADER,"  S  CTT  TNG 
"TPXT  POINTER,"  "TP,"  to  GO  CAUSES  T HE  PROCEDURE  GETWORD 
TO  READ  THE  F  P  ST  DATA  CAPO. 

***  ****  ***************  *****  **  **  ******  *******  **************** 
C OM MFNT *** ******** **** ****** * 

HEAOEPCOUNT  IS  THc  NUMBER  OF  CARDS  PRECEEDING  THP  DATA 
WHICH  DESCRIBE  THE  TYPE / QU AL I TY / ETC .  OF  THE  DATA  BUT  APF 
MOT  td  qe  READ  AS  PART  OF  THE  DATA. 
***************************** • 

PROCEDURE  INITIALIZE: 

BEGIN 
INDEX: =0; 

READ  (HEADCOUNT): 

FOP  I  :=  1  UNTIL  HEADCOUNT  DO 
BEGIN 

HEADER ( I )  :  =  "  "  : 

RE ADCAP Dt  HE  ACER (  I)  ): 

END: 

I OCONTROL  (2): 
writfc  •’): 

IOCONTROL  (2): 

TP  :=  80 

END  initialize: 


PROCEDURE  SETCOUNT; 
BEGIN 

Fno  i  j  UNTIL  DIM  DO 

BEGIN 

ONE  1 (4,1) :  =  0 : 

0NC2 (4,  I ) :=C: 

0NC  3(4,1 ) :  =  0: 

0NE4<  4 , I ) :  =  0 : 

ONE  5(4,1 ) :=0: 

0NC  6(4,1 ) :  =  0: 

E6(  4,1  )  :  =0: 

0Ne  TWO  2(16,1 )  : =0 : 

ONE  TW03 (16,1)  :  =  0 : 

ONE  TW04 (16,1)  :=o: 
0NETWO5( 16, I ) :»0; 
0NETW06( 16,1 ) : =0 ; 

FIR  STL  A  ST5 ( 16,11  :  =  0: 

F I  °  STL  A  ST6  (  16,1)  :  =  0: 

EE  26 ( 1 6 , 1 ) : =0  ; 
0NETW03^(64,I  1 :  =0: 
ONETW034 ( 64 , I ) :=0; 
0NETW035( 64,1 ) :=0: 

0NPT WO  3  6 ( 64 , 1  ) : =0: 

ONET WO 3 44(256,1 ) :=0; 

ON FT WO 345(256, 1  )  :=0: 
0NETW0346( 256, I ) :=0? 

F 1 2  EE2  6(  256,1  ):  =  0; 

ONET W034E6( 1024,1) :=0: 
0NETW034E5 ( 1024. I ) :=0; 
ALLS0UND(4096 ,1  )  :=0: 
END: 

END  SETCOUNT; 
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COMMENT  ******  ************************  *******  ************* 
°ROC  EDUR F  GENE^ATCTWO  °ROV!DES  THR  16  TWO-SOUMD  CLASS  LA&RL 
OF  S I  GNAT  IONS  IN  ALPHABETICAL  ORDER.  THESE  L  A  B  CL  S  APC 
STORED  IN  THE  ONE  DIMENSION  ARRAY  "SOUNDTWO."  THE  LABELS 
CORRESPOND  TO  the  IS  ROWS  OF  THE  SCT  OF  REAL  ARRAYS  NAMEO 
"ON=TWO_" ,"FIRSTLAST_” ,  AMD  "RR26." 

*************  ******  ******************************  ********** 

PROCEDURE  GEN Ep  ATC TWO : 

BEGIN 
INTEGER  Y ' 

Y  •  —  0  * 

FOP  I  :=  0  UNTIL  3  DO 
BEGIN 

ROR  J  :=  0  UNTIL  3  DO 
BEGIN 

SCUNDTWO(Y)  :=  '• 

SOUNDTWO  { Y )  (Oil)  :  =  CLASS  (IHOjl); 

SOUNDTWO  ( Y ) (  1 !  1 )  :=  CLASS  (J)(o|l); 

Y  :=  Y  *  1 
END  I 

END; 

END  GEMFRATETWO; 


COMMENT  ******************  ******************************** 
PROCEDURE  G  EM  F RAT  ET  HR  E  c  TAKES  THE  IS  ELFM^NT  AR  R  A v 
"S  PUNPTWO"  AND  GENERATES  THF  64  POSSIBLE  COMBINATIONS  Oc 
THE  FIRST  THREF  SOUNDS  OF  A  WORD  IN  ALPHABETICAL  ORD^p. 

THF  64  ELEMENTS  ARE  STORED  IN  THE  ONE  DIMENSIONAL  AD  D  A  Y 
"SOIJNDTHREE"  AND  APE  USED  AS  LABELS  FOP  THR  64  ROWS  OF  THF 
SET  OF  REAL  ARRAYS  "QNETW03 

***************************  £******************  ************* 


PROCEDURE  GENEPATRTHRFC; 

BEGIN 
INTEGER  Y; 

Y  :  =  o  * 

FOR  I  : =  0  UNTIL  1 5  DO 
BEGIN 

cOR  J  :=  0  UNTIL  3  DO 
BEGIN 

SOUNDTHREE ( Y)  :=  " 

SOUNDTHPEE(Y) (0 !?>  :  =  S OUNDT W0( I )  ( 0  !  2 )  : 

SOUNDTHREE! Y)  ( 2  1 1 >  :  =  CLASS ( J ) O I  1 ) : 

Y  :=  YM 
END  ; 

END; 

END  GENERATETHREE; 


PROCEDURE  GENERATEFDUR : 

BEGIN 
INTEGER  y; 

Y  :  =o  * 

FOR  I* : =  0  UNTIL  63  DO 
BCG  I  N 

C0R  J  :=  0  UNTIL  3  DO 
BEGIN 

SOUNDFOUR(Y)  :  = 

SOUNOFOURIY)  ( 0  I  3 )  :  =  SpUNOTHRcc(I ) ( 0  I  3  >  ; 

SOUNDROUR(Y) (3ll):=CLASS(J)(0|l»; 

Y  :=  Y  + 1 
END  : 

END; 

END  GENERATEP01JP; 
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PROCEDURE  GEMER  ATf$ IX; 

8CG  I N 

I NTcGpR  X,Y; 

Y  :=  0: 

X  •  —  -  1  j 

FOP  I :  =  6  UNTIL  255  DO 
BEGIN 

FOR  J  : =  0  UNTIL  *  DO 
BEGIN 
X  •  - X+ 1  ; 

SOUNDFI VF(X) : =" 

SOUNDF IVE( X) (014): =SOUNDFOUR(I  )  (0  I  41  ; 

SOUNDF I VE ( X ) (  4  I  1  ) : =CL A  SS (J)  (0|1) ; 

FOP  K  :=  0  UNTIL  3  DO 
BEGIN 

SOUNDSI  X(  Y)  :  =  "  "; 

SOUNDS  IX (Y)  (015)  :=SOUNDFIVE (X) ( 0  I  5)  ; 

SOUNDS  IX (Y) ( 5  I  1 ) :=CLASS(K  )  (0  I  1  )  ; 

Y :  =Y  +  1 
END; 

END  ; 

END; 

END  GENERATESIX; 

£QHMPNT  **  #***  ******  ******  **$****#**:•:  *****$  £*£**£*:!!$  *  £:£**** 
PROCEDURE  GETARRAY  READS  IN  th  =  101  CHARACTERS  USED  BY  thi S 
PROGRAM  TO  RE  PRE  SCNT  the  INTERNATIONAL  PHONETIC  AL  °HAB  FT  . 
EACH  SOUND  REPRESENTATION  IS  PLACED  IN  A  TWO  COLUMN  ARRAY 
"ROM  I . "  COLUMN  ONE  IS  thc  SOUND  REPRESENTATION  AND  COLUMN 
TWO  CONTAINS  THE  CLASS:  EITHFp  NASAL,  FPICATIVF,  ST"Pf  OR 
VOWEL.  THE  NUMBER  OF  CARDS  USED  TO  READ  IN  "ROMI"  CANNOT 
BE  VARIED  WITHOUT  A  L  TE  R I NG  THE  ORCGPAM.  THE  NUMBER  OF 
ELEMENTS  IN  "ROMI"  IS  NOT  VARIABLE.  "PQMT"  IS  PRINTED  OUT 
A  CARD  AT  A  TIME,  JUST  AS  IT  T S  TYP=D  IN  TH=  OATA  CARDS. 


RROC  FOUR  F  GETARRAY; 

BEGIN 

STRING (81)  PHONOL: 

INTEGER  B, A ; 

B :  =  1 ; 

A  :  =  I : 

WHILE  (A  <=  27)  DO 
BCGIN 

PHONCL  :=  "  "; 

READCARD  (PHONOL); 

Tp  »  =0* 

WHILF(TP  <=  7 9 ) DO 
BEGIN 

WH I L c  (PHONOL  ( TP  I  1 )  ="  ")  AND  (TP<=79)  DO 
TR:=  TR+l; 

IF  (  Tp  80  )  THEN 
BEGIN 

IF  (PHONOL  (TP  +  1  |  1  )-•  =  "-"  )  THEN 
BEGIN 

ROMI  ( B  ,  1 ) :  =  PHONCL  ( TP  1 1 )  ; 

T°:  =  TP  +  R; 


END 
EL  Sc 
BEGIN 
ROMI 
ROMI 


( B,  1) (01  1) 
( e ,  i )  ( 1 1 1 ) 


RHONOL  ( T  P | 1 )  ; 
PHONOL  ( TP  +  2 I  1 )  ; 


TP: =TP+4; 

END; 

ROMI  ( e , 2 ) t  =  PHONOL  ( Tp  |  1)  ; 


B:  =  8*1  ; 


TP  :=  TP+1 

END 

ELSE 


A  :=  A  +  1 ; 
END 
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END ; 

TP  :=  80 
END  GET ARP  AY: 
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COMMENT  *************************************************** 
LOGICAL  PROC  FDURC  G  F  T  W  0  R  0  READS  THE  INPUT  DATA  WP  ITtfN  GM 
"RDM  I "  TNTO  ft  BUFFER  CALLED  "TFXT,"  ONc  CARD  AT  ft  TIME  AMD 
FCHO  PRINTS  EACH  CARD  WITH  A  SE  DUE  NT  I AL  NUMBER  IDENTIcter 
"INDEX."  THE  BUFFC5,  "TEXT,"  IS  SCANNED  ONE  CHARACTER  AT 
ft  TIME  UNTIL  A  "WDPD"  IS  BUILT.  LEADING  AND  TRAILING 
BLANKS  DELIMIT  ft  "WORD."  "WORD"  IS  THE  GLOBAL  P  ARAM"TE  R 
OF  STRING  TYPE  Oe  MAXIMUM  LENGTH  Oc  36  CHARA^TPRS  WMIfH 
IS  DELIVERED  TO  THE  MAIN  PPOGPAM  AMD  SUPSEOUcNT  PROCEDURE. 
WHEN  THE  CHARACTER  »*"  IS  ENCOUNTERED  ON  ft  DATA  CARD  IN 
OTH=R  THAN  CARD  COLJMN  1  THE  LOGICAL  VALUE  OF  "GETWORD"  IS 
SET  TO  f&lSE  THUS  SIGNALLING  THE  END  OF  A  DftTA  SET. 

"WORDS"  MUST  START  AND  END  ON  THE  S AM c  CARD. 

*******  ***************  ************************************** 

LOGICAL  PROCEDURE  GETWORD  <STRING(36)  RESULT  WORD); 

BEGIN 
WP  :=  0; 

IF(TP=bO)  THEN 
BEGIN 

INDEX: =INDEX*1 ; 

RE  ADC  Ap  D  (  TEXT)  ; 

TP:  =0 
END; 

WHILp(TEXT(TP I  1 )="  ")  DO 

BEGIN 

To; =tp+i ; 

IF  ( TP  =  P  0 )  THEN 
BEGIN 

RE  ADC A°  D  ( TFXT) ; 

INDE  X:  =  I NDE  X+ 1 ; 

TP :  =0 

END 

END; 

W  QP  D :  =  "  " ; 

WHILE  (  TP<80  )  AND  (  TEXT  (  TP  I  l  )-.  =  «  ")  DO 
BEGIN 

WORD  (WP| 1) :=  T EXT ( Tp | 1  ); 

WP  :=  WP  +  1; 

To  :=  TP  +  I 
END; 

WP  : =  WP-1; 

WORD-*=,,<?" 

END  GETWORD; 


COMMENT  *************  ***************5 *********************** 
PROCEDUPF  "CONVERT"  OPERATES  ON  T HE  GLOBAL  VA3IABLC  "WORD" 
OBTAINEO  FROM  "GETWOPO."  "WORD"  IS  SCANNED  CHARACTER  BY 
CHARACTER  TO  OBTAIN  EACH  "ROM!"  SOUND  RFP°cSFNT&t ION. 

A  BINARY  SEARCH  OP  THE  ARRAY  "POMI"  IS  CONDUCTED  T0 
OBTAIN  THE  SOUND  CLASS  EQUIVALENT  FOR  EACH  "SOUND"  DP 
"WORD."  A  NEW  STRING,  "CODE,"  IS  GENERATED  f ROM  THE  POUR 
SOUND  DESIGNATIONS  CORRESPONDING  to  THE  INTERNATIONAL 
PHONETIC  ALPHABET  SPELLING  OF  "WORD."  "CONVERT"  IS  ALSO 
CAPABLE  Op  EDITING  INPUT  data  An'Q  PRINTS  OUT  A  WARNING 
LABEL  ALONG  WITH  the  PARTICULAR  "SOUND"  P  =  PR  E  R  PNT  AT  I  n\« 

WHICH  DID  NOT  APPPAR  IN  "RQMI"  AND  THR  "WHRP"  IM  WHICH  THE 
ERROR  OCCURRED.  A  SAMPLE  OUTPUT  PROM  "CONVcRT"  WOULD  BE 
"FVNSFVNN."  THc  COUNTER  " Z"  IS  ALSO  A  GLOBAL  VARTA8Lc, 

AND  TELLS  THE  NUMBER  OF  IPA  SOUNDS  IN  EACH  CTPING  "CODE." 

"Z"  IS  USED  TO  DRIVE  LATER  PORTIONS  OF  THE  PROGRAM. 

IF  A  MISTAKE  IS  MADE  nN  A  DATA  CA»D  SUCH  THAT  A  CHARACTER 
STRING  IS  FOUND  that  IS  NOT  A  MEMBER  OF  POMI,  THE  "WORD" 
CONTAINING  THC  UNKNOWN  "SOUND"  IS  PRINTED  OUT  ALONG  WITH 
AN  ERROR  MESSAGE.  WHEN  THIS  CONDITION  ARTSPS.  "BAD"  IS  SET 
T0  FALSE  AND  NO  ANALYSIS  IS  DON<=  ON  th=  "CODE"  BY  THE 
remainder  OF  the  program. 

******  *  ******  *******  **  *****  ******************  *********  A****; 

PROCEDURE  CONVERT  ( STR I NG (36)  RESULT  CODE  I : 

BEGIN 

STRING (2)  SOUND; 

INTEGFR  I, TEST; 

INTEGER  LISTENC,  LISTBEGIN,  POINTER; 

COPE  :=  "  "; 

BAD  :  =  TRUE; 

I  :=  0; 

Z  :  =  0; 

WHILE  I<=WP  DO 

BEGIN 

TEST  :=  o; 

FOUND  :=  trup: 

SOUND: ="  »; 

IF  WORD  ( 1+1 ! 1 )-  =  THEN 

BEGIN 

SOUND  :  =  WORD  (III); 

I:  =  1+1;  END 

ELSE 

BEGIN 

SOUND ( 0  I  1 ) : =  WORD( III); 

SOUND (  111)  :  =  WORD(  H-2|  1  )  ; 

I:  =  1+3 
END ; 

L I STBEGIN: =  1; 

LISTEND  :=  101; 

POINTER  :=  LISTEMP; 

WHILE  (  FOUND  )  Dn 
B  CG  I  N 

TEST  :=  TEST+1; 

IF  SOUND  <  POMI (POINTER t  1)  THEN 
L I STEND : =  POINTFR; 

IE  SOUND  >  ROMI ( POINTER, 1)  THEN 
L I STBEGIN: =  POINT<=p: 

Ic  (  SOUND  =  ROMI  (  POINTER, 1  ))THFN 

FOUND  :=  FALSE 

ELSE 

8FGIN 

IF  (  TEST  =  12  )  tmcvj 
BPGIN 

WR  I TE ( INDEX, T=XT ) : 

WRITP("ERONEOUS  DATA",  "WORD  =  ",  WORD,  "SOUND  =  ", POUND); 
FOUND  :=  FALSE; 

I  : =  W°  +  l ; 

BAD  :=  FALSE 
ENP; 

POINTER  :=  TRUNCATE  (  (  LISTEND  +  LISTBEGIN  )  /  2  ) 
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END ; 

END; 

CODE  (  Z|1  )  :=  POM  I  (  POTNTPR,  ?  )  (oil  ); 
Z  :=  Z+l  : 

IF  ( BAD  =  FAL5F)  thSN 

Z  ;=  1 

END; 

Z  ;=  Z-l 
END  CONVERT; 
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COMMEN"1’****^*^*******^^*******^1^*************************** 

PROCEDURE  PROB.TWD  CALCULATES  THE  PROBABILISTIC  PROGRESSIONS 
<PP'$).  THE  PP'St  WHEN  CALCULATED,  ARE  REASSIGNED  TO  THE 
CORRESPONDING  BASE  DATA  FREOUENCIES  ARRAYS.  THE  PALLING 
ARGUEMENTS  ARE: 

IP.  REAL  ARRAY  BASF:  CORRESPONDS  T0  THF  CUMHULATIVE 
SOUND  COMBINATION  FOR  WHICH  p P  IS  BEING  CALCULATED. 

?)  REAL  ARRAY  PAP  TPR08 :  THE  ARRAY  CONTAINING  THF 
"C OND I T I CNA  L"  PORTION  OP  THF  PR. 

3)  PEAL  APRAY  nUTnjT:  THC  ARC  A Y  TO  WHICH  THF  PP ' S  APC  to 
RE  ASSINGED.  OUTdijj  MUST  BE  DIMENSIONED  !DcNTICALLY  WITH 
BASE. 

4)  LC:  THE  NUMBEP  OF  ELEMENTS  IN  OARTPROB  LESS  ONE. 

5)  FNID:  THF  NUMB E°  OF  ELF MCNT S  im  RASP  DWIDFD  BY  rH  = 

NUMB FR  0 F  ELFMENTS  IN  PAPT  PRDB  L<=SS  ONE. 

*************** AAA A A* A A A* ******** A*** ******* A A  A A* ********** ; 

PROCEDURE  PROB  T WQ (REAL  ARRAY  B A S E ( * , *  )  :  R E A L  A  R°  A  Y  daptpphb 
(*,*);RFAL  ARR^Y  OUTPUT («,*); INTEGER  VALUE  LC  t  FN I D ) 5 
BEGIN 
REAL  B; 

INTFGFR  T,P; 

:  =- 1  : 

FOR  H  :=  0  UNTIL  LC  DO 
BFGIN 

FOR  I  : =  0  UNTIL  FNID  DO 
BEGIN 
9  :=  0; 

T  :=  T  + 1  : 

FOR  J  :=  1  UNTIL  NUMB c  R  DO 

B  :*  B  +  9ASE(T  ,  J  )*PARTPRn(?  (h  ,  j)  ; 

FOR  K  :=  1  UNTIL  NUM8FP  DO 
BEGIN 

IF  (8^=0)  THEN 

OUTPUT ( T  tK )  :=  B ASE ( T t K ) *  PA RTPROB ( H , K ) / 9 

EL  Sc 

OUTPUT ( T  ,0 )  : =- 1 
END; 

END  *. 

FND; 

END  PR09_TW0; 
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COM  Sk#  sjcsjc  sfcajcaG::?:  afeJfc*#  sfcajc  #*:£*#  **#*:$:  3{«:fc:js  £25: 

PROCEDURE  DECIDE  READS  TEST  SAMPLFS,  ANALYSES  THEM  AMD 
PRF  SENT  S  A  SET  OF  NUMBERS  TO  THE  ANALYSIST  FDR  A  DECISION. 
"SAMPLE  SIZE"  IS  THE  NUMBER  OF  WORDS  IN  THE  TEST  SAMPLE 
LFSS  ONF. 

"0"  COUNTS  THE  NUMBER  OF  WORDS  IN  A  TEST  SAMPLE  FOR 
PRINTING  A  I  DENT Ic I C  AT  I  ON . 

»N"  COUNTS  THE  NUMBER  OF  S  A  M  °L  E  S  (WHICH  IS  LIMITED  ONLY  BY 
TIME  AVAILABLE  OM  thf  COMPiJTFR)  . 

THE  "CASE"  STATEMENT  ALLOWS  THE  SELECTION  OF  0NC  PROGRAM 
SEGMENT  FROM  A  LIST  FOR  EXECUTION  DEPENDING  nM  THE 
ORDINAL  VALUE  OF  THE  "CASE"  STATEMENT  PARAMETER  " p . "  THF 
VALUF  OF  "P"  DE°ENDS  ON  "Z"  AMD  IS  ASSIGNED  3V  AN  "IF" 
STATEMENT. 

**BY  VARYING  THE  RANGE  OF  "o»  With  CHANGES  Tn  the  «Ic" 
STATEMENT  IN  "DECIDE"  AND  "RAW  ANALYSIS"  THE  PROGRAMMER 
CAN  SELECT  THE  PARTITION  OF  THF  DATA  TF ST  SETS  AND  SAMPLE 
TEST  SETS  ACCORDING  tq  WORD  LENGTH. 

"T»  TELL  LATER  P  AD  T  $  OF  "DECIDE"  HOW  MANY  ROWS  OF  thf 
ARRAY  "DECISION"  WFrp  USED  for  THAT  WORD... 
*********************************************************** 


PROCEDURE  DECIDE; 

BEGIN 
REAL  C,B; 

REAL  ARRAY  DE C ! S ION ( 0 :  : 9 , 1 : s NUMB E P ) : 

REAL  ARRAY  SA VEDEC I S I  ON ( 0 : : SAMPLES  I Z E t  I : : NUM3E R)  ; 
INTEGER  P»t,Q; 

INTUELDSIZE:  =  *  ; 

Q  s  =0; 

N  : =  m+  1 ; 

WRI TE( "SAMPLE  NUMB f R " t  n )  : 

FOP  K:  =0  UNTTL  SAMPLESIZE  DO 
BEGIN 


* 


Q  :=  0+ I ; 

FOUND: =GETWOPD( WORD)  ; 

CONVERT(CODE)  ; 

WR  T  TF  (  "  IOCONTROU  2)  ; 

WRITC( "SAMPLE  WORD  NUMBER" , C  » " I M  CODE  FORM  IS:", CODE): 
WRITE!"  IOCONTROU  2)  *, 

I F ( B AD  =  TRUE )  THEN 
BEGIN 


IF ( Z<=5 )  then  P:=Z+1  ELSE  P:=6; 
CASE  o  OF  BEGTN 


COMMENT****  WORD  L=NGTH  =  one*****; 


BEGIN 
T :  =  0  * 

FTNOSOUMD(O)  ; 

0NEP0INT:  =  P0I  NTEP: 

FOR  I  :=  1  UNTIL  NUMBER  DO 
DEC  ISI ON ( 0  » I ) :  =  0MC1 ( POINTER , I) 
END; 


COMMENT****  WORD  LENGTH  =  TWO*****; 
BEGIN 

F I  NO  SOUND ( 0 ) : 

T  :  =  1  ; 

ONE  POINT  :=POI  NT  ER  ; 

FOP  I : = 1  UNTIL  NUMBER  DO 

DEC  I  SI  ON (0, I) :=ONE 2 (POINTER , I ) ; 

F I  NO  SOUND (1  )  ; 

POI NTE R :  =  ONEPO I NT*44- POINTER  : 

FOR  I : =1  UNTIL  NUMBER  DO 


DEC  T  SIGN  < 1 ,1 ) :=ONFTWO?  <  POINTE®, I ) 
END; 


COMMFNT****  WOP  D  l  fnjjth  =  THREE**** 

BEGIN 
t  ;  =  ?  ; 

F I  NO SOUND (0 ) : 

ONE  POI NT:  =  POT NT E  P : 

FOR  I : = 1  UNTIL  NUMBER  DC 

DEC  ISIONIO,  !  )  :=0N=3! POINTER,  I  )  ; 

F I NDSOUND ( 1 ) ; 

POT  NtER:=0NFPCINt*4  4-P0INTcp  : 

TWOpnI  NT  :  =PQI  NTEP; 

POP  I : = 1  UNTIL  NUMBER  DO 

DEC ISI0N!1  ,1) : =0NFTWC3 ! POINTER , I )  : 

F I NDSOUND (2) : 

POI NTER :=TW0P0!NT*4fO0INTEP : 

FOR  1 1  = !  UNTIL  NUMBF®  ^0 

OEC IS  ION ( 2, I )  : =ONCT  W03  3 ! °0I NT c R  «  I  )  ; 

END  ; 


COMMENT****  WORD  LcNSTM  =  FOUR***** 

BEGIN 
T  :  =  3  ; 

F  I  NDSOUND!  0  )  *. 

ONE  PCI  NT :  =  POI NTEp; 

C0P  I : = 1  UNTIL  NUMBER  DO 

DEC  I  SI  ON ( 0 , I )  :=0NE4( POI NTEP , I) : 

F  I  NDSOUND  (  1  )  *, 

POT  NTcR: =ONEPOI NT*4  +  P0  I  NT EP : 

TWO  POI NT : =p0T NT  E 0 : 

FOP  I :  =  1  UNTIL  NUMSE3  DO 

DEC  ISIONd  ,  I  )  :  =0  NET  WO  4  (  POINTER,  I  )  : 

F I  NO  SOUND! 7  ) : 

POI Nter : =*W0P0INT*4fp3l NtfR ; 

ONE  POI NT :  =  PQI NTER; 

FOR  I :  =  1  UNTIL  NIJMBEP  DO 
DECISION! 2, I ) : =0N=tW034( POI ntfr t i ); 
F I NDSOUND! 3) ; 

POINTER:=CNEPCTNT*4fPOd'TEP  : 

FOR  I :  =  I  UNTIL  NUM9  fo  no 

DcC  I  SI  ON (3, I)  : =0NCTW03 44 ! POINTER ,  I) 

END; 


COMMENT****  WORD  LENGTH  =  FIVE***** 

BEGIN 
T  :  =  4  • 

F I NDSOUND ( 0 ) ; 

ONEPOINT :=POI NTEP; 

FOR  I  ;=  1  UNTIL  NUMBER  DO 
DEC  I  SI  ON (0 , 1 )  ;=0NE5 ( POINTER , I ) ; 
FIMOS^UND! 1 ) ; 

POINTER ;=CNFP0TNT*4fP0INTFP : 

TWHPOI  NT  ;  =p  01  NTER: 

FOR  I : = 1  UNTIL  NUMBER  DC 

DEC  ISION! I , I ) : =0NETW05 ! POINTER,  I) ; 

F I NDSOUND ( 7 ) : 

PQ ! NTE  R : =TWO PPI NT*4  +  D0 1  NT E R ; 
THPFFPOINT: =POI  NT=R; 

FOP  I ; = 1  UNTIL  NUMBER  DO 

DFC  I  SI  ON! 2, I ) ;=0NCTW03  5! POIMTcp, I ) ; 

F I NDSOUND ( 3 ) ; 

p0 INTER : =TH REF POI NT*4  +  POINTER: 
TWOPOi NT :=P0T  ntpp : 

CQR  I :  = !  UNTIL  NUMBER  DO 

DEC  I  SI  ON (3, I )  : =0NFTW03  45  ( POINTER, t) 


F INDSOUNDl 4 ) ; 

P0INTER:=TW0PQINT*4i-P0 INTER  ; 

POP  I:=l  U^TIL  NUMOPP  DO 

DEC  I  SI  ON (4, I )  : =0MCTW034F 5 ( POINTER tT ) 

END; 


COMMENT****  WORD  L  ENGT H  =  SIX  Z  GREATER***** ; 

BEGIN 
T  :  =  R  ; 

F I ND SOUND (0 ) ; 

ONE  °0I NT ;  =  POI NTER: 

FOR  I : = 1  UNTIL  NUMBER  00 

DEC  I S I  ON ( 0  ,  I ) :  =  0MC 6 (POINTER,  I); 

F I NDSOUND ( 1 ) ; 

PO !  NTER  :  =ONEPO I NT*4 <-PGINTER  ; 

TWOPOI MT:=POI  NTER: 

FOR  1 1  =  1  UNTIL  NUMBER  nn 

DEC  I  SION ( 1 , 1 ) : =QNETWQ5 ( POINTER, I )  ; 

F I MOSOUND ( ? ) ; 

PO I MTER:=TW0P0INT*4+ POINTER; 

THRFPPCINT :=POI NTPR; 

FOR  I  :=  1  UNTIL  NUMBER  DO 

DEC  I  SI  ON (2 ,1)  : =0NFTW036 ( POINTER , I ) : 

F I  NOSOUND ( 3 ) ; 

POINTER  :  =THREEP0IMT*4«-P0 INTER  ; 

«=0UPP0INT:=P0I  NTER  ; 

P0R  I :  =  1  UNTIL  NUMBER  DO 

DEC  I  SI  ON (3,1  )  :=0NETW03  46 ( POINTER, T ) ; 

F I  NO SOUND ( Z )  ; 

EPOI NT := POINTER; 

POINTER  :=  ONcPOINT*44-POINTER ; 

FOR  I :  =  1  UNTIL  NUM3EP  00 

DEC  I  ST  ON (4, I) :  =  F IR  STL  A  ST6( POINTER , II  ; 

POINTER : =FOUR POINT *4  APPOINT ; 

FOR  I : =1  UNTIL  NUMBER  DO 

DEC  I  SI  ON (8, 1 ) :  =  0NpTW334c  6( DOINTPR , I ) : 

SPACE: =Z-1 ; 

FINDSOUND(SPACE)  ; 

POINTER :=EPOINT*4fPOINTER; 
c dpi NT:= DOINTPR; 

FOR  I :  =  1  UNTIL  NUMBER  DO 

DECT  SI  ON (5, 1 ) :  =  Ep26( POINTPR , I ) : 

POI  NTCP  ;=P0URP0INT*1S<-EDCINT ; 

FOP  I : = 1  UNTIL  NUMBER  DO 

DEC  I  SI  ON  (6,1)  :  =  ALL  S0'JN0(  POINTER,  I  )  ; 

POI NTE  R :=TWOPOTNT* 1 S+E  POINT: 

FOR  I ; = 1  UNTIL  NUMBER  CO 

DEf  I  S I  ON  (7,1  )  :  =pl?EE26  (  doimt<=R,  I  ) 

pND; 

END; 


COMMENT**************************************  ****  *********** 
HERE  A  COLUMN  PRODUCT  IS  OBTAINED  FROM  THC  ARRAY 
"DEC  IS  I  ON"  AND  ASSIGNED  TO  THF  "K  TH "  ROW  OF  " S AVE DEC I  $  I  ON . " 
ONE  ROW  IN  "SAVEOFC  IS  ION"  IS  GENERATE  FOR  CACH  WORD  IN  the 
TEST  SAMPLE  SEt. 

**********************************************  *************  ; 

FOR  J: = 1  UNTIL  NUMBER  DO 
3FG  I N 

DECISION!*?,  J)  :  =  1  : 

POP  I : =0  UNTIL  T  DO 

DECISION!*?,  J)  :=0FCISI3N(9,J)*DECISI0N(  I  ,J)  ; 

END: 

WRITE! "VALIDITY  FACTORS  :"); 

WRITE! m  »);  IOCONTBOL! 2) ; 

FOR  J  :=  0  UNTIL  T  DO 
BEGIN 
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FOR  I  :=  1  UNTIL  NUMBER  00 
WRITE0N(0ECIS  TCN(  J  ,  I  )  )  : 

WMTE!"  ") 

END; 

FOR  J  :=  1  UNTIL  NU^RFD  pn 
SA VEDEC  I  S  I  ON  {  K,  J  )  t  =D«=C  I  SION  (9,  J) 
END  ; 

ENO  : 


COMMENT**************************************************** 
THE  NUMBERS  FOR  THF  DECISION  CPITERIA  ARE  A?tAInED  F&Om  thp 
ARRAY  "SAVEOEC  IS  ION1'  BY  TAKING  ROTH  A  COLUMN  P®OOUCT  AND 
COLUMN  APETHME^IC  MEAN. 

*  ^t*******#:*:**:**#**  ***:£*  ft*  ****«$  *«#****  #****$*  *££***#*  I****** 

IF(SAMPLESIZF=0)  THEN 
BEGIN 

FOR  J:  =1  UNTIL  NUMBER  00  WR  I  T  EON  (  S  A  V  FDEC I  S  ION  (  0  ,  J  I) 

END 

ELSE  BEGIN 

WR  I  TE(  "  " ) ;  IOC  ONTROL  {  ?)  ; 

WRITE( "VALIDITY  FACTOR  PRODUCTS:"); 

FOP  J: =0  UNTIL  Q-I  DO 
BEGIN 

WRITE!"  " ) : 

FOP  I :  =  1  UNTIL  NUMBER  DO 
WRITEON(SAVEDECISTON{J,  I ) ) 

END ; 

FOR  J: =1  UNTIL  NUMBER  DO 
BEGIN 
B :  =  1 ; 

C :  =  0 ; 

FOR  I  :=  0  UNTIL  SAMPLESIZR  DO 
BEGIN 

B  :=  B*SAVEDECIS ION! I , J ) : 

C: =C+S  A VEDEC I S ION! I, J) 

END; 

C:=C/(  SAMPLESIZE+1  )  ; 

WR I TE ( "  ");  IOCCNTROL!  2) ; 

WRITS! B) ; 

WRITEON(C)  ; 

WRITS!"  ") 

END; 

END; 

END  DECIDE; 
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COMMENT****  **  ^  aje  #  5{c 

PROCEDURE  COMPUTERDOM  PEPRE  SEMIS  A  CONVENI  ENT  METHOD  th 
GROUP  MULTIPLE  CALLS  TO  thc  SAME  PROCcD'JRE  WITHOUT 
CLUTTERING  UP  the  MAIM  PROGRAM. 

**4**t*ti***i4**4i*!l!****4!|ci**)(t!!:*i!*$*4*iW****$#**4i***!|<*!!!i*  ; 

PROCEDURE  COM  PUTC_MDRM ; 

BEGIN 

COMPUTE  ANY (  3  ,  ONEl  )  ; 

COMPUTE_ANY( 3 ,ONE2 ) ; 

CqmoutE  ANY(R,ONE3): 
compute  Zany (3  ,  0NE4 ) ; 

COMPUTE  ANY (  3 , 0 Nc5 ) ; 

COMPUTElANY ( 3  ,ONF6)  ; 

COMPUTE  ANY ( 3  ,  p6 ) ' 

COMPUTE- ANY ( I  5  ,ONETWD2 ) ; 

COM  °UTEZ ANY ( 1 5 , 0NCTW03 ) ; 

COMPUTF_ANY( 1 5, 0NETWG4 ) ; 

COM°UTE_ANY( 1 5, OMPtwo? ) ; 

COMPUTE  ANY (15, ONE Oft ) ? 

com  put e Zany ( i  5, fir stl a  sts ) ; 

COMPUTE  ANY  (  1  5t  PIRSTLA  ET6  )  : 

COMPUTE_ANY( IE, EE?6 ) ; 

COMPUTE_ANY(63,ONETrfD33  )  ; 

CnMPUTE_ANY(63,0NETW0’4) ; 

COMPUTE  ANY! 63,0NCTW035 ) ; 

C0MPUTE_ANY(63,0NFTW036 ) ; 

C0MOUTE_ANY(  0  55,0NETW0R44)  ; 

COMPUTE_ ANY (255,ONPTWD345 ); 

COM  PUT  P  ANY ( ?  55 , ONE TWO 3 46 ) ; 

COMPUTEZANYI 259,E12EE26) ; 

COMPUTE. ANY ( 1 023 ,0NETW034F5 ) ; 

COMPUTE.ANYl I023,DNETW034F6 ) ; 

COMPUTE  ANYI4095, ALLSOUND) 

END  CPMPUTE.NOPM; 
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COMM  ENT***  *’5'***  ************  **************************** 
PROCEDURE  RAW  ANALYSIS  USES  AN  IDENTICAL  "CASF"  STAT=MF 
TO  THAT  USED  IN  "DECIDE'1  TO  PARTI T 1 0  M  the  DATA  BASE  S  =  t 

#** ***#£:(:**  A**#******:*:*  ******************************** 

PROCEDURE  RAW_ANALYS IS ; 

BEGIN 
INTEGER  P; 

I F ( B  AD  =  T  RUE )  THEN 
BEGIN 

PQI NTFR :=0; 

IF ( Z<  =  5  )  THEN  R  t  =  Z  + 1  ELSE  P:=6; 

CASE  P  OF  BEGIN 


COMMENT  ****** WOP  DS  WITH  ONE  SOUND******; 
ONE  SOUND ( ONE  1  ,0,4)  ; 


COMMENT******WORDS  WITH  TW0  SOUNDS******; 
BEGIN 

ONFSOUNDIONE? ,0, 4) ; 

ONF SOUND! ONET W02 ,1 ,16) 

END: 


COMMENT******WORDS  WITH  THREP  SOUNDS******; 
BEGIN 

ONE  SOUND (0NE3, 0,4)  ; 

ONE  SOUND ( ONET W0  3 ,1,16)  ; 

ONF  SOUND ( ONET Wn 33, 2 , 64 ) 

END; 


COMMENT ****** WORDS  WITH  FOUR  SOUNDS******: 
BEGIN 

ONF  SOUND! 0NE4 , 0, 4) ; 

ONE  SOUND (0NETW04,1 , 16)  : 

CNF  SOUND (ONET WO  34, 2, 64) ; 

ONE  SOUND! ONET WO  344, 3,266) 

END  ; 


COM ME NT ****** WORDS  WITH  FTVE  SOUNDS******: 


BEGIN 

ONESOUND(CME5 ,0,4) : 

ONF  PCI NT:=PQ! NTER: 

ONE  SOUND (ONET WO 5, 1 ,16)  : 

ONE  SOUND (ONET WO  3 5, 2, 64 ) ; 

ONE  SOUND (ONE TWO  345 ,3,256) ; 
0NC  SOUND ( ON FT WP34c5»4t  1024) ; 
POTNTER:=CNEPOTMT: 
0NES0UND(FIRSTLAST5,4, 16) 
END; 


C0MMFNT******WORDS  WITH  SIX  £  G°=ATcc  SOUNDS******: 
BEGIN 

0NCS0UNC(0NE6 ,0,4) ; 

ONF  PQI  NT :  =  POI  NTER; 

ONF  SOUND ( ONET W06 , 1 ,16) ; 

TWOPOI NTssPOI NTFR; 

ONESQUMD( 0NETW036, 2 , 64 ) ; 

0NES0UNC(0NFTW0346 ,3,256) ; 

FOURPOI Nt:=POINT=p ; 

POINTER  :=0; 
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0NFS0UND(E6, Z ,4 ) ; 

SPACE: =0NEP0INT*4+P 01  NTER  ; 

FI  PSTL  AST6(  S P ACE,  NUMB CR  )  :  =F  I  RSTLAST6  (  S  P4CP  ,  NUMBER  )  f  1 
FI  PSTL  AST6(  1  ?>  tNUMBER)  :  =  FIRSTLAST6  (  16  , NUMBER  )  M  ; 
cIPSTLAST6(SPACE,0 ):=l: 

SPACE : =  FOUP  POI NT*4+°0I  MTER  : 

ONFT  W034E6  (  SP  ACE,  NUMBER  )  :  =0NETWQ34E6  (  SPACC,M'JMBFR  )  + 1 
ONETWP34E6(SDACE,0) :=1 : 

0NETW034E6 ( 1024, NUMBER ) : =0NFTW034E6( 1024,NUMBCR) fl ; 
SPACE:  =?.-!; 

0NFS0UN0(FF26 , SPACE,  16 )  ; 

SP ACF :  =  FOU P PO IN T*1  6  + POINTER  : 

ALL  SOUND (SPACE, NUMBER ) : = ALL  SOUND ( S PAC 5 , NUMBER »  +1 : 

ALL  S  QUND (SPACE, 9):=1; 

ALL  SOUND (4096, NUMBER) : =  ALLS OUND (  4096 , NUMBER ) v 1 ; 
SPACE:  =  TWOP  01  NT*  16  4-POINTER  ; 

F12EF26 (SPACE ,NUMRFR 1  :  =  F 1 2E E2  6 ( S PACE , NUM0F R ) f 1 ; 
F12EE26(SDACF,0):=1; 

F12EE2  6  (256,  NUMBER  )  :  =  c  1  2  EE2  6  (  2  56  ,  NUMB  EP  )  +1 
END; 


END; 

END; 

END  P A  W_ANA  LYSIS; 
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C  HMM  cn  **  **  ***  *  **  * 

DRPC  EDUP  E  COw?'JTP_AMY 
containing  FREQUENCIES 
DAT  A  ARRAY  CONTAINS  TH 
FREQUENCY  DATA  STORED 
TOTAL  IS  DIVIDED  INTO 
A  PERCENTAGE  0»=  OCCJRR 
WITHIN  the  GIVEN  PARTI 
THE  CALLING  A  RGUEMPNTS 

1) .  INTEGER  Iteration 
THE  ARRAY  WHICH  HAS  FR 
GENERAL  IT  ROW  OIMENSI 

2)  RAW  NUMBERS:  the 

*******5**********  **** 


*  **  ** 

IS  EXECUTED 
°ER  LANGUA 
E  TOTAL  NIJM 
IN  THP  CORR 
EACH  FNTPY 
A MCE  pPR  th 
T  T ON. 

APE : 

:  THE  SUBS 

E  OUENCv  INF 
ON  OF  THE  A 
ARRAY  BEING 
*********** 


*************************** 
ONCE  FOR  EACH  ARRAY 
GF.  THE  LART  ROW  OF  EACH 
BER  OF  WORDS  WHICH  HAVE 
ES PENDING  COLUMN.  this 
IN  the  COLUMN  GIVING 
AT  SOUND  COMBINATION 


CRT PT  OF  TH=  LAST  ROW  IN 
ORMATION  IN  IT.  IN 
PPAV  BEtMG  PASSED  TM  LESS  1 
NORMALIZED. 

*************  *************; 


PROCEDURE  COMPUTE  ANY( INTEGER  VALUE  I TERAT ION ; RFAL  ARRAY 
RAW  NUMBERS ( * ,*) ) T 
BEGIN 
INTEGER  T; 

T  :=  I  TER AT  I C  N+ 1  : 

I E  (  RAW_NUMBERS(T,  NUMBER  )-.=D  )  THEN 
BEGIN 

FOR  K  :=  0  UNTIL  ITERATION  DO 

RAW  NUMBERS ( K .NUMPEP ) : =R AW_NUM3ERS ( K .NUMBER) /RAW  NUMBER S 
(T, NUMBER) 

END; 

END  COM  PUT F  ANY; 
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COMMENT^*3**************  ****************  ********************* 

SINCE  ALGOL-W  HAS  M3  <=OUIVALcNT  Tp  a  FORTRAN  DATA  STATEMENT, 
ARRAYS  MUST  R F  INITIALIZED  TO  THE  DESIRED  VALUE.  THIS  IS 
DONE  CNF  COLUMN  A T  A  TIME.  "ZIP"  IS  thf  VALUE  GIVEN  ^0  EACH 
ARRAY  ELEMENT.  CnL'JMN  ZERO  IN  FACH  ARRAY  IS  SET  tq  MINUS 
ONE.  THIS  VALUE  IS  CHANGED  TO  PLUS  ONF  WHEN  AN  =NTRY  1= 
FREQUENCY  INFORMATION  IS  MADE  IN  THAT  PARTICULAT  ROW.  3Y 
CHECKING  COLUMN  ZCR3  IN  LATpr  PARTS  OF  THE  PROGRAM, 

DIVISION  Bv  ZERO  IS  ELIMINATED. 

£  sjcrfe#  ^C3tc  't  ; 

PROCEDURE  CLEARMFM(REAL  VALUE  ZI°); 

BEGIN 

INTEGER  J.K,L,R,N; 

J:=K:=L:=R:=N:=0; 

FOP  I  :=  0  UNTIL  *  30 
BEGIN 

0NC 1 ( I , NUMBER ) :=ZTP; 

ONF R I  I, NUMBER ) :=ZIP; 

ON  E  3 ( I , NUMBER ) :  =  ZI P; 

ONF 4 ( I t  KUMRCR ) :  =  ZI P ; 

ONF  5 ( I , NUMBER) :=ZT°; 

ONES ( I , NUMBER ) :=ZT°: 

E6t I  , NUMBER) : =  7IP; 

FOR  o  ;=  o  UNTIL  3  30 
BEGIN 

0NETW02 (J, NUMBER) :=ZIR; 

0NFTW03I J, NUMBER) :=ZI P; 

ONFTW04I J, NUMBER ): =ZIP: 

0NETW05 ( J,NUMPER) :=Z IP ; 

0NFTW06 ( J, NUMBER) :=ZIP; 

FIPSTLASTEI J, NUMBER ) :=ZID; 

FI°STLAST6( J.MUMBFP ) :=ZIP; 

EE26( J,NUMBER):=ZTP; 

J:  =  J  +  l ; 

FOR  S  :=  0  Until  3  30 
BEGIN 

0NETW033(K, NUMBER ) : =Z I  P ; 

ONE TWO 34 (K, NUMBER) :=ZIB; 

0NEtw035(k, NUMBER) :=ZI P; 

ONET WO 36(K, NUMBER) :  =  Z I P : 

FOR  T  :=  0  UNTIL  3  30 
BEGIN 

ONE TWO 344( L, NUMBER ) :  =  Z I P ; 

ONET W3 345 (L , NUMBER ):=ZIP; 

ON FT  WO 346 ( L , NUMBER ) :  =  Z I P : 

F 12EE2 6 ( L, NUMBER) : =ZI D; 

L: =L+1 ; 

FOP  M  :=  0  UNTIL  °  30 
BFGIN 

CNFTW034E6(R, NUMBER ) :=ZTP; 

0NETW034E5(R, NUMBER) :=ZIP; 

R I  =  R* l  * 

FOP  0  :=  0  UNTIL  3  30 
BEGIN 

ALLSOUND  (N, NUMBER) :=ZIP; 

N:  =  N  + 1 
END: 

END ; 

END; 

EN3 : 

END; 

END  ; 

END  CLEAPMEM; 
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COMMENT******  ***£:!*:  ***£:£ 

PROCEDURE  FINDSOUND  IS  CALLED  FROM  '’DECIDE"  AN 
VALUE  OF  "POINTER"  CORRESPONDING  TO  THE  SUBSCR 
SOUND  IN  THE  ARRAY  "CLASS."  PECAUSE  ALL  THE  S 
COMBINATIONS  ARF  POWERS  OF  FOUR.  IT  IS  A  SIMPL 
COMPUTE  SAY  A  TWO- SC'JND  COMBINATION  BY  RFMFM8E 
FIRST  SOUND  WAS.  M'JLT  I  PLY  I NG  TH  =  FIRST  SOUND  S 
AND  ADDING  THE  ScCOND  SOUND  SUBSCRIPT.  THC  BA 
MODULO  4  APITHNFTIC.  NO  MATTER  WHAT  SOUND  COM 
THE  OBJECTIVE  OF  THE  SEARCH.  TH  =  PROGRAM  ONLY 
LIST  OF  FOUR  CLEMFNTS.  "WS"  TELL  "cINDSOUND" 
IN  THE  WORD  tq  LOOK  AT. 

********************  *  **  ****  **  **  i**#*4A).^***^ia 


*********  *  *  *  ** 
D  RETURNS  A 
IPT  Oc  thf 
OUNn 

F  MATTER  TO 
RING  WHAT  thf 
UBSCRTPT  BY  4 
SIC  IPEA  OF 
BINATION  IS 
SEARCHES  A 
WHICH  SOUND 

***** ******** ; 


PROCEDURE  FINDSOUNDI INTEGER  VALUE  WS ) ; 
BEGIN 

PQI NTFR :=0; 

WHILE!  CODE  (WS  I  1  )  -« =  C  L  A  S  S  (  POINTER  )  )  DO 
POI NTER :  =  POI NT  FR  +  1 ; 

END  FI ND SOUND ; 


qq^MEN  t****  ************  ************************************* 
PROCEDURF  ONESOUNO  IS  C  A  L  L c  D  FROM  RAW  ANALYSES.  TT 
OPERATES  ON  THF  SAME  PRINCIPLE  AS  "FlFiDSOUND"  EXCEPT  THAT  TT 
AL  f  0  INCREMENTS  The  FREQUENCY  COUNT  \  M  thf  A5’  Rnp  P.  i  at  =  ARRAY 
"OUTPUT"  AND  INCREMENTS  THE  TOTAL  WORD  COUNTER  WITHIM 
"OUTPUT."  THF  RAW  OF  "OUTPUT"  WHICH  CONTAINS  THC  WORD 
COUNTER  MUST  PE  SPECIFIED  3*  THE  AR GUENFNT  "L E NGT H . " 
**********************************************  *************  ; 

PROCEDURE  ONESOUNO  (REAL  A<?RAY  OUTPUT  (*,  *) ;  I  UTEGpp  VALUE 
WS. LENGTH) ; 

BEGIN 

INTEGER  PIN; 

PIN  :=  POINTER; POINTER :=o; 

WHILE! CODF(WS I  1 )-=CLASS( POI NTFR) )DO 
POINTER ;=P0INTER+1 ; 

POINTER :=P IN* 4  + POT NTER  ? 

OUT  PUT ( POI NTFR, NUMBER)  ;  =  OUT  PUT ( PO I  NT FR , NUMBER ) + 1 ; 

OUTPUT (LENGTH, NUMBER ) : =OUTPUT( LENGTH , NUM8E R ) «■ I ; 

OUTDUT( POINTER ,0) :=l 
END  ONESCUND; 
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COMMENT********************  ********************************* 
PROCEDURE  SINGLE  PRD3S  COMPUTES  the  ACTUAL  °RO  BAB  IL  Tty  OF  A 
LANGUAGE  GIVcN  AFJy  SINGLE  SOUND  WHICH  the  PROGRAM  LOOKS  AT 
AS  A  SEPARATE  ENTT  Ty .  I.F.t  THE  FIRST  SOUND  OF  EACH  WPPD 
AND  THE  LAST  SOUND  OF  WORDS  SIX  SOUND  AND  GPC  ATE0  IN  LENGTH. 
THE  ASSUMPTION  IS  MADE  THAT  ALL  LANGUAGES  AR=  EQUALLY 
PR.OBABLF. 

*********************************************************** ; 

PROCEDURE  SINGLE  PPDBS? 

BEGIN 

RPAL  5,C »D,F, FtGtH; 

FOP  I  :=  0  UNTIL  3  DO 
BEGIN 

B:=c;=D:=E:=F :=G:=H:=D: 

FOP  J  :=  1  UNTIL  NUMBER  DO 
BEGIN 

B  :=  B  +  ONFK  I  t  • 

C  :=  C  +  ON  E  2  ( I .  J  ) : 

D  : =  D+  CNE3 (  I, J)t 
E  :  =  E  +  ONE 4 (  I, J): 
f  :=  F4-  OME5(  I ,  J  ) : 

G  : =  G  +  0NR6 ( I , J  )  ; 

H: =H+E 6 (  It J ) 

END? 

FOP  K  :=  1  UNTIL  NUMB f R  no 
8EGIN 

I  F  (  B-t=0  )  THEN 

ON  c 1 ( I t  K ) : =ON  F 1 (  1 1  K  )  / 3  ; 

IE(C-t=0)  THEN 

0NE2(I  tK) :=ONE2(  I,K  )/C 

I F  (  D-t=  0  )  THFM 

0NE3(I ,K):=ONE3(ItK)/D  ; 

I  F  (  F-,=  0  )  THEN 

ONE  4 ( I  t  K ) : =0NE4 (  I »  K ) / -  ; 

IF  (  F-,=  o  )  THEN 

ONE  5 (I tK):=ONE5(TtK)/p  ; 

I F  (  G-*=  0  )  THEM 

0N<=6(I  ,K)  :=0NF6(  I,K)/G  ; 

I  F  (  H-<=0  )  THFN 

E6(  I  ,K  )  :  =E6  (  I  ,K  )  /H; 

END ; 

end; 

END  SINGLE  RROBS; 
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COMMENT****** 

PROCEDURE  GRAPH  PROVIDES  A  CONVENIENT  GRAPHICAL 
REPRESENTATION  OF  ANY  DATA  ARP  AV  USED  AT  ANY  POINT  IN  th£ 
PROGRAM.  THE  CALLING  ELEMENTS  ARE: 

1)  DATA:  THE  ARRAY  TO  BE  GRAPHED. 

2)  LABEL:  THE  STRING  ARRAY  WITH  LABELS  CORRESPONDING  TO 
DATA. 

3)  COUNT:  THE  LAST  ROW  SUBSCRIPT  OF  "DATA"  WHICH  CONTAINS 
INFORMATION  TO  BE  GRAPHED,  I,F.t  ROW  DIMENSION  OF  DATA  LESS 
ONF. 

4)  SCALE:  NUMBER  OF  PERCENTAGE  POINTS  PFR  PRINT  SPACE, 
EITHER  1  OR  2. 

5)  HEADING:  10  T p  GRAPH  IS  TO  SHOW  RANGE  OF  ZFRq  TO  ONE 

HUNDRED.  5  IF  RANGE  IS  ZERO  T0  FIFTY. 

6)  WIDTH:  WIDTH  Oc  GRAPH  POINT  SPACE,  50  OR  100. 

7)  TOGGLE:  COLUMN  to  CHECK  TO  S  FF  I  <=  THAT  row  CONTAINS 
ANY  NUMBER  0T HER  THAN  ZERO.  IS  SET  TO  ZERO  cr>R  ALL  CASES. 

GRAPH  WILL  PRINT  A  LETTER  CORR FSPONO ING  TO  THE  ORDER  OF  THE 
BASP  DATA  SETS.  A  =  LANGUAGE  1,  B  =  LANGUAGE  2,  ETC. 

IF  TWO  VALUES  ARE  IDENTICAL  ON  A  LINE  OF  TH E  GRAPH  IS 

PRINTED.  IF  A  WIDTH  OF  50  AND  SCALC  OF  "  IS  SELECTED, 
NORMALIZED  SETS  OF  data  CAN  BE  REPRESENTED  BECAUSE  tH= 

VALUF  WHICH  IS  GREATER  THAN  50  WILL  BE  PRINTED  IN  TH<=  RIGHT 
MARGIN  OF  THE  GRAPH. 

$A  A  *AA$***#*$  £*<'**££  $***'’!'£=!':!' ********************  *£#*#******; 

PROCEDURE  GRAPW  (REAL  ARRAY  DATA(*,*); 

ST  R I NG ( 6 )  ARRAY  LA  B  FL  (  *  I  ; 

INTEGER  VALUE  COUNT , SC  ALE, HEADING  , WIDTH, TOGGLE); 

BEGIN 

STRING (1)  OVERFLOW; 

INTEGER  AMT, BIG, LINES; 

LOGICAL  Ton  BIG; 

string (17)  Flag; 

STRING(l)  ARRAY  MATRIX  (0::WIDTH); 

procedure  horiz.linf; 
begin 

IOC ONTPPL ( 2 ) ;  WRITEC'  "); 

FOP  I  : =0  UNTIL  WIDTH  DO 
BEGIN 

IF (  I  REM  5  =  0)  tHFm 

WRITpon("+")  ELSE  WR I tcON ( "-" )  END: 

END  HOP  I Z_L I NE ; 

PROCEDURE  hopiz.head; 

BEGIN 
INTEGER  p; 

WRITEC'  "); 

IOCONTPOL(? ) ;  WRI Tc ( "  "); 

I F ( W IDTH=50 )  THEN  INTF I  ELDS  I ZE : =3  PLSC  I NTF I E L DS I ZE :  =  3 ; 

FOR  I  :=  1  UNTIL  10  DO 
BPG I  N 

P  ;=  SHEADING  ;  WRITFON(P) 

END; 

END  HORIZ_HEAD? 

CL AG := "  ABCDEFGHI JKLMNPR "; 

HORIZ  HEAD: 

HOP  I Z~L I NE ; 

LI NF  S :  =  0 ; 

FOR  I  :  =  0  UNTIL  COUNT  DO 
BEGIN 

TOO  9IG:=FALSF; 

OVERFLOW:  =" 

TF(QATA(  I,TOGGLE)-»  =  -l)  THEN 
BEGIN 

LINES:=LINFS+1  ; 

FOR  J  :=  0  UNTIL  WIDTH  00 
BEGIN 

MATRIX(J)  :=  " 
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FOR  K  :=  1  UNTIL  NUMBER  DO 
BEGIN 

AMT  :  =R  O'JND  (  (  DAT  A  (  I  ,  K  )  *  1  00  )  /  SC  ALC  )  : 

IF ( J=WIDTH) AND< AMT>J )  THEN  BEGIN 

TOO  BIG:=TRUE; 

0VE*cL0W(0| 1) :=  flA3(K|1); 

BIG:  =AMT 
END: 

I  F ( A  MT  =  J  )  THEN 
BEGIN 

IF  (  MATRIX!  J  )^=”  ")  THEM 
MATRIX(J)  := 

ELSE  MATRix(J)  :=  FLAG(Kll) 

END; 

END; 

END: 

I OC  ONT  RQL ( 2  )  : 

WRITE! LABCL( I) » ”  |  "  )  : 

FOP  H  :=  0  UNTIL  WIDTH  00 
BEGIN 

WP  I  TEON ( MATR I  X ( H ) ) 

END  : 

writfomcm  ") ; 

INTcTFLOSIZE:=T: 

I F ( T  00  BIG* TRUE)  THCN  WR IT FOM( OV E RFL OW , B I G )  ; 

I c ( C0URT<1 6 )  THEN 

BFGIN 

WRITE!"  I"); 

FOR  T  :=  o  UNTIL  WIDTH  DO  WRITEON!"  ");  WP  I T  P  ON (  ”  | " > 
END; 

END; 

IF! L IN  ES=5C )  THFN 

BEGIN 

LI NCS: =0 ; 

HOP  I Z  LINE; 

HOP! Z  HEAD: 

IOCUNTrcL ( 3 ) ;  WRITE!” COM TINUCD"): 

WRITE!”  "I; 

HORI Z.hrad; 

HOP  I Z_L I NE 
END; 

END; 

I0CQNTO0L! 2) ; 

HORTZ  LINE; 

HORI Z  HEAD  ; 

W°  I T  ET”  ”5;  WRIT^i "sample  SIZES  ARE  :”); 

AMT  :=  COUNT* l: 
inC  0NTROL<  2)  : 

FOR  I  1  UNTIL  NUMBER  DO  WR! TEON! D A ta ( A*T , T ) ) 

END  GRAPH; 
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MA I N~ PROGRAM :  GREATER  FLEXIBILITY  WAS  R  E AL I Z  c  D  BY 

ENCLOSING  THE  ENTTRF  PROGRAM  IN  THE  "DUMMY"  PROCEDURE 
THIS  ALLOWS  THE  PROGRAMMER  T0  $DCCIFYt  WITH  THE  VALUE  Cc 
"DIM"  HOW  MANY  LANGUAGES  ARE  TO  BE  LOAOED  AS  BASE  DATA. 

CL  A  SS ( 0  )  :  =  "  ": 

CLASS! 0) (Oil) :  =  "  F  "  ; 

CLASS! 1):="  "; 

CLASS! 1) (01 n :="N"; 

CL  A  S  S(  2  )  :  ="  "; 

CLASS! 2) (01 1) :="S"; 

CL  ASS ( 3 ) : =" 

CL  A  SS ( 3 ) (0| 1)  :="V«; 

GENERATETWO; 

GENERATPTHPEF; 

GENPRATFFOUP : 

GENERATC  SIX  *. 

GETARR ay; 

SETCOUNT; 

NUMBER  :=  0; 

CLEARMEM!-! ) : 

N  :=  1; 


WH I L  E ( N>0)  AND! NUMBEP<DI M)  DO 
BEGIN 

NUMBER  :=  NUMBER  +  1  ; 

INI TI ALI ZF; 

CLEARMEM ( 0) ; 


WHILE! GETWORD(WORD) )  DO 
BEGIN 

CONVERT (CODE)  : 

RAW  ANALYSIS 
END; 

C  OM  °U'rE_NORM ; 

READ(N) 

END; 
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GRAPH ( 0NE1  ,r LASS 
GRAPH(ONE2, CLASS,? 
GRA  PH( 0NE3 , CL  ASS , 3 
GRAOH(0NE4, CLASS  ,3 
IOCnNTROLt?) ; 

GRA  PH ( ONE5  ,CLAS? 
GRA  PH ( ONE6  , CLASS 
GRAPH(E6  , CLASS, 


,3,2,10,50,0) ; 
,1,  5,50,0); 
,1,  5,50,0); 

,1,  5,50,0); 

,3,1,  5,50,0); 
,3,1,  5,50,0); 
3,1,  5,50,0); 


GRAPH! ONETWO? 

,  S  O'JNOTWO , 

15, 

1  , 

5, 

5", 

0 

) ; 

GRA  DH( 0NETW03 

I OC  ONTROL ( 3 ) ; 

, SOUNOTWO , 

15, 

1  , 

5  , 

50, 

0 

); 

GP  A°H( ONETW04 

, SOUNOTWO , 

15, 

l  , 

5, 

-o, 

0 

) ; 

GRA  PH ( 0NETW05 

IOC ONTROL (3 )  ; 

, SOUNOTWO, 

15, 

l  , 

5, 

50, 

0 

); 

GRAPH! 0NETW06 

, SOUNOTWO, 

15, 

1  , 

5, 

50, 

0 

) ; 

GRAPHtFIR  STLAST5 

I OCGNTROL ( 3 )  ; 

,  SDUNDTVJO  , 

15, 

1, 

5, 

50, 

0 

) ; 

GRAPH! FIPSTLAST6 
GR  A  DH  (  E  E26 

, SOUNOTWO, 
, SOUNOTWO, 

15, 

15,1 

1  , 

,  5, 

5, 

50, 

r 

o 

trw 

o 

0 

) ; 

I  OCONTROL  (3); 


GRAPH  (ONFTWP33 

, SOUNDTHREE 

,  63, 

1  , 

5, 

50, 

0) 

GRAPH  ( ON ET WO 34 

,soundthref 

,  63, 

1  , 

5, 

50, 

0) 

I Or  ONTROL ( 3 )  ; 

GRAPH  ( ONETWO 3 5 

, SOUNDTHREE 

,  63, 

1  , 

5, 

■^o. 

0  ) 

GRAPH  (0NFTW036 

, SOUNDTHR  Ec 

,  67 , 

1  , 

5, 

50, 

0) 

IOC ONTROL!?) ; 

GRAPH  ( ONETWO  344 

, S  0  UNDFOUP  , 

255, 

1  , 

5, 

o 

in 

0) 

I OC  ONT  ROL ( 3 ) ; 

GRAPH  ( CNFTW0345 

, SOUNDFOUR  , 

255  , 

1  , 

5, 

50, 

0) 

IOCONTPOL ( 3 ) ; 

GRAPH  (0NFTW0346 

, S3UNDP0UR  , 

255, 

1  , 

5, 

50, 

0) 

IOCONTROL! 3) ; 

GRAPH  ( F12FE26 

, SOUNDFOUR  , 

255  , 

1, 

5  , 

50, 

0  ) 

I OCONTROL ( ? ) ; 

GRAPH  (CNFTW034E5, 
I OCONTROL (3); 

GRAPH  (0NFTWP34F6, 

I oc  ontrol ( ? ) ; 


1023  ,1 

,  5 

,50 

,  0 

) 

1023  ,1 

,  5 

o 

IT, 

► 

,0 

) 

1  UL  l.  1  1  ,MJ  L  \  ?  1  . 

GRAPH  (ALLSCUNO  ,SOUNDSTX  ,4095  ,1  ,  5  ,50  ,0  ); 
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S I NGlc  PROPS: 

PROP  TR0(0N?TW02,  ONP?,  0NFTW02,  3,  3); 

PROB- TW0(0NFTW03  ,  ONF3 ,  0NFTW03.  3,  3): 
pROB"tWO(ONETVIC33,ONETW03,ONETWD3  3 ,1  5,3  )  ; 
PRO8“TW0(0NETW04,  0NC4,  ON  ETW04 ,  3,  3): 
PR0B“TW0(0NPTW034,  0NETW04,  0NFTW03*-,  15t  3); 

pROB~T  WO ( ONE TWO 344 ,  ONJETW034,  ONcTW0344,  63,  3); 

PR 03  TWO ( ONFT W05 ,  0NC5  ,  ONE TWO 5 ,  3,  3>t 
PROS- TWO ( ON ET  WO 3  5 ,  0MFTW05,  0NFTW035  ,  15,  3); 

PROB  TWO  (ONE"' WO  3  45  ,  0NFTW035,  0NETW0345  ,  63,  3); 
°ROB— TWO ( ONFT  W034E  5 ,  0NFTW0345,  0NETWO34E5,  355,  3); 

PR08  TWO (F1RSTLAST5,  0NF5,  cIRSTLiST5,  3,  3); 

PROB- TWO ( OMpT  W06 ,  CNR6  ,  0NETW06,  3,  3); 
PR06~TWQ(CMETw036,  0M=TW06,  0NFTWO36,  15,  3); 
PROB^TWO  ( CNETTW0T46 ,  0M5TWO36,  0Nci"W0346,  63,  3): 
PROB- TWO ( ONE  rW034h6 ,  HNETW0346,  0NETW034F6,  255,  3); 

PR0B_TW0(FF26 ,-^S7  EE26,  3,  3); 

PROB  TW0(F12EE26,nNETW06,Fl2FE26,  15,15); 

PROB  TWO ( ALLSOUND, ONE TW 03 4 6 , a LL SOUND ,255,15); 

PROB  tW0(FIR$TLASt6,  0NF6,  FIPSTLAST6,  3,  3); 
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GR  A  PH ( ONE  1  ,CLASS,3,1,  5,50,0); 

GRA  PH<  0NE2, CL  ASS , 3,1,  5,50,0): 

GRAPH(0NE3, CLASS, 3,1,  5,50,0); 
GRAPH(0N<=4, CLASS, 3,1  ,  5,50,0): 

I OCONTROL ( 3 ) ; 

GRAPH (0NE5  ,CLA' 

GPAPH(0ME6  ,  C  L  A 
GRA0H(F6  ,CLAS! 

I OCONTROL ( 3 ) ; 

GRA  PH( 0NETW02 
GRA  PH ( 0NFTW03 
I  OC  ONTROL ( 3 )  ; 

GRAPH! ON  ETW04 
GRAPH! 0NETW05 
I  OCONTROL! 3)  ; 

GRAPH! 0NETW06 
GRAPH! FIRSTLAST5 
IPCONTROL!?)  *, 

GRAPH! FIRSTLA  ST6 
GRAPH! EE26 


3,1,  5,50 

,0)  : 

3,1,  5,50 

,0) ; 

,1,  5,50, 

0)  ; 

SOUNDTWO, 

15, 

1, 

5, 

=;n 

SOUNDTWO, 

15, 

1 , 

5, 

50 

SOUNDTWO, 

15, 

1, 

5, 

50 

SOUNDTWO , 

15, 

1  , 

5, 

50 

SOUNDTWO  , 

15, 

It 

5, 

50 

SOUNDTWO, 

15, 

1  , 

5, 

50 

SOUNDTWO, 

15, 

1  , 

5, 

50 

SOUNDTWO, 

15,1 

,  5, 

50, 

0): 

, SOUNOTH0  FF  ,  63,  1 

, SOUNDTHRFF  ,  63,  1 


,soundthoef 

, SOUMDTHREc 
, SOUNOFOUP  • 
, SOUNOFOUR 
, SOUNOFOUP  • 
,SOUNDFQUR 


63,  1 
63,  1 


I OC  ONT  ROL ( 3 )  ; 

GRAPH  ( ON FT Wn 3 3 
GRAPH  (0NETW034 
IOCONTPOL! 3) : 

GRAPH  ( ONcT W03  5 
GRAPH  (0NFTW036 
I OC  ONTR OL ( 3 )  : 

GRAPH  (0NFTWC344 
IOrONTROL ! 3) ; 

GRAPH  (0NFTW0345 
I0C0NTR0H3)  ; 

GRAPH  (0NETW0346 
I OC  ONT  ROL ( 3 )  ; 

GPAPH  (F12FE26 
I OC  ONT  ROL ( 3 ) : 

GRAPH  ( GNFTWP34E5 , S3UMDF IVE  ,1033  ,1 
I OC  ONT  ROL  !  ?  )  : 

GRAPH  (0NETW034F6, S3UN0FTVF  ,1023  ,1 
IOC  ONT ROL ( 3 ) ; 

GRAPH  (ALLSOUND  ,SOUNOSIX  ,^095  ,1  , 

IOC  ONT ROL (3 ) ; 


?  55  , 
255, 
255, 
255  , 


0  ) 
0  ) 

0  1 
0  ) 

0  ) 
0  ) 


5,  50,  0); 
5,  50,  0): 

5,  50,  0); 
5,  50,  0); 

5,  50,  0); 

5,  50,  0); 

5,  50,  0); 

5,  50,  0); 

,50  ,0  ) ; 

,50  ,0  )  ; 


5  ,50  ,0  ): 
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READ(SAMFLESIZF)  ; 

N  :  =  0  ; 

WH!LF(  SAMPLES  I  Z  E  -»=  —  1  )  00 

SFOIN 

TP  :=  80; 

DECIDE: 

I OC  ONT  PPL ( 3 )  ; 
READtSAMDLESI ZF) 

FMD; 

END; 


READ(OIM) ; 

DUMMY 

END. 
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C  0  M  M  P  N  T  *  **  *  *  *  #  *  * 

THE  OAT A  OFCK  IS  TO  BE 

1 )  .  CARD  WITH  the  MUM 
ALAMVSTS. 

2) .  ROM  I . 

3)  .  CAPO  WITH  TH=  NJM 
identifying  INFOP.MATIO 
THIS  IS  ASSIGNED  AS  HE 

4) .  DATA  CARDS  CONIAI 
SPELLING  OF  WORDS.  A 
CONTINUED  ON  THE  NEXT. 

5)  .  CARD  WITH  IN 

SFT. 

6)  .  CARD  WITH  -1,  IF 
NUMBER  OTHER  THAN  ZERO 
WILL  FOLLOW.  SUGGFSTE 
ZFP 0  FOLLOWING  THF  =  1  N 
ONF  THROUGH  "N"  FOR  TH 
FOLLOWING  THE  FINAL  TE 
********************** 


*  ****************  ********************* 
STACKED  AS  FOLLOWS: 

BER  OF  LANGUAGCS  FOR  BASF  DATA 


BEP  OF  CARDS  WHICH  C 
N  ABOUT  THE  BASC  DAT 
ADCOUNT. 

NING  POMI  REPRESENTATIONS 
WORD  CANNOT  BE  STARTED  ON 


ONT  AIN 
A  ELEMCNTS, 


OR  THE  IRA 
ONE  CARD  AMD 


COLUMN  9  TO  SIGNAL  END  °F  DATA  BASE 

NO  MORE  BASE  DATA  SCTS  FOLLOW.  ANY 
WILL  SIGNAL  THAT  ANOTHEP  S=T 
D  SEQUENCE  IS  ONE  through  four  WITH 
AL  BASE  DATA  S  FT »  THEN  THEN  NUMBERS 
E  TEST  DATA  SAMPLES.  WITH  A  -1 
ST  DATA  SET. 

************************************i 


c  qmmpmt#***  *******  ***************************  *********  ****** 
ROMI  'MUST  ALWAYS  BP  THE  PIRST  DATA  SET  FOLLOWING  THE  "  ?D AT  A" 
CAPO  ( JCL) .  THE  NUMBER  "5"  SHOWN  BPLOW  TPLL  HOW  MANY  SETS 
OP  REFERENCE  DATA  WILL  BE  USPD  IN  THE  INITIAL  SPTUP. 

NOTE  THAT  ROf.'I  IS  WRITTEN  SUCH  THAT  EACH  CHARACTER  OF  ROMI 
IS  POLLOWED  IMMEDIATELY  (SEPARATED  BY  CNP  SPACE!  BY  THP 
CATEGORY  TO  WHICH  IT  BELONGS.  TWO  LETTERS  JOINFD  BY  A  "-» 
ARE  READ  AS  A  SINGLE  LFTt-o. 

*********************************************************** ; 
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SDATA 

5 

A 

V 

A-  A 

V 

A-B 

V 

A-E 

\r 

B 

S 

B-B 

F 

B-V 

s 

C 

V 

c-c. 

S 

C-F 

V 

C-H 

F 

0 

R 

D-D 

R 

D-Z 

s 

E 

V 

E-E 

V 

E-I 

V 

E-0 

V 

F 

F 

F-F 

s 

G 

S 

G-A 

V 

G-B 

C 

G-F 

F 

H 

F 

H-H 

F 

H-W 

N 

I 

V 

I  - 1 

V 

I-R 

V 

I-T 

V 

J 

N 

J-A 

F 

J-N 

N 

J-'J 

V 

K 

S 

K-X 

S 

K-Z 

F 

L 

N 

L-L 

N 

M 

N 

M-J 

N 

N 

M 

N- J 

N 

N-N 

N 

N-Q 

N 

G 

V 

0-F 

V 

0-0 

V 

C-'J 

V 

P 

S 

p-c 

S 

Q 

F 

Q-A 

F 

0-0 

S 

R 

N 

P-B 

c 

R-P. 

N 

R-T 

N 

S 

F 

S-H 

F 

S-S 

F 

T 

S 

T-C 

S 

T-H 

F 

T-L 

R 

U 

V 

U-H 

V 

u-o 

V 

J-Q 

F 

V 

F 

V-V 

V 

w 

N 

W-W 

V 

X 

F 

X-X 

F 

V 

V 

Y-U 

N 

Y-y 

V 

z 

F 

Z-C 

F 

Z-H 

F 

l-l 

F 

3- 

-R 

V 

A- !  V/  A-N  F  A-P  V  A-U  V 

C-T  V  C-Q  V 

E-R  V  E-S  V 

G-G  S 

T-U  V 

M-U  R 


T-m  c  t-s  S  T-T  S 
U-S  V  U-T  V  U-U  V 


180 


?DAT  A 

5 

A 

V 

A-A 

V 

A-B 

V 

A-E 

V 

B 

<; 

B-B 

F 

B-V 

c 

C 

V 

c-c 

S 

C-E 

V 

C-H 

F 

0 

S 

0-0 

s 

D-Z 

s 

E 

V 

E-E 

V 

E-I 

V 

E-Q 

V 

F 

F 

F-F 

s 

G 

s 

G-A 

V 

G-B 

F 

G-F 

F 

H 

F 

H-F 

F 

H-W 

N 

I 

V 

I-I 

V 

I-R 

V 

I-T 

V 

J 

N 

J-A 

F 

J-N 

N 

J-U 

V 

K 

<; 

K-X 

c 

K-Z 

F 

L 

N 

L-L 

N 

M 

N 

Y-J 

N 

N 

N 

N-J 

N 

N-N 

N 

N-Q 

N 

0 

V 

0-E 

V 

0-0 

V 

0-U 

V 

P 

S 

P-F 

Q 

F 

Q-A 

F 

Q-Q 

S 

R 

N 

R-S 

F 

R-R 

N 

R-T 

N 

<: 

F 

S-H 

F 

S-S 

F 

T 

S 

T-C 

S 

i-H 

F 

T-L 

S 

U 

V 

U-H 

V 

U-Q 

V 

U-F 

F 

V 

F 

V-V 

V 

w 

N 

W-W 

V 

X 

F 

X-X 

F 

Y 

V 

Y-U 

N 

Y-Y 

V 

Z 

F 

z-c 

F 

Z-H 

F 

Z-Z 

F 

3- 

■R 

V 

A- I  V  A-N  F  A-R  V  A-U  V 

C-I  V  C-Q  V 

E-R  V  c-S  V 

G-G  S 

I-U  V 

N-U  S 


T-M  S  T-S  S  T-T  S 
U-S  V  U-T  V  U-U  V 
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